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ALIGNMENTS 



RESULT 1 
AAU00438 

ID AAU00438 standard; protein; 425 AA. 

XX 

AC AAU00438; 
XX 

DT 17-MAY-2001 (first entry) 
XX 

DE Human neuropeptide receptor. 
XX 

KW Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 

KW nervous system disorder; hyperprolif erative disorder; diabetes mellitus; 

KW cardiovascular disorder; autoimmune disorder; infectious disorder; 

KW eating behaviour disorder; narcolepsy; neurological disease; 

KW narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 

KW protein co-ordinate data. 



XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Region 47. .72 

FT /label= TMl 

FT /note= "Transmembrane region 1" 

FT Region 83. .106 

FT /label= TM2 

FT /note= "Transmembrane region 2" 

FT Region 112. .142 

FT /label= TM3 

FT /note= "Transmembrane region 3" 

FT Region 163. .189 

FT /label= TM4 

FT /note= "Transmembrane region 4" 

FT Region 214. .239 

FT /label= TM5 

FT /note= "Transmembrane region 5" 

FT Region 299. .327 

FT /label= TM6 

FT ' /note= "Transmembrane region 6" 

FT Region 335. .363 

FT /iabel= TM7 

FT /note= "Transmembrane region 7" 
XX 

PN WO200117532-A1. 
XX 

PD 15-MAR-2001. 
XX 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR lO-SEP-1999; 99US-00393696 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 

DR N-PSDB; AAS00491. 
XX 

PT A new nucleic acid encoding a human neuropeptide receptor polypeptide, 

PT useful for preventing, treating or ameliorating obesity, narcolepsy, 

PT neurological disease and addiction to narcotics, nicotine and alcohol. 
XX 

PS Claim 3; Fig 4; 385pp; English. 
XX 

CC The present sequence represents a novel human neuropeptide receptor which 

CC shows sequence homology to the neuropeptide Y receptor. Two splice 

CC variants of the neuropeptide receptor (AAU00439-AAU0044 0) and a possible 

CC mutant (AAU00442) are also described. Polypeptides and polynucleotides of 

CC the neuropeptide receptor are useful for diagnosing, preventing, or 

CC treating a pathological condition in a subject related to the central . 

CC nervous and peripheral nervous systems (CNS and PNS) . The polypeptides 

CC and polynucleotides may be used to treat hyperprolif erative, 

CC cardiovascular, autoimmune, nervous system or infectious disorders e.g. 

CC cancer, heart disease, rheumatoid arthritis, Alzheimer's disease, HIV 



CC infection and diabetes mellitus. In particular they are useful for 

CC preventing, treating or ameliorating a medical condition in a mammal such 

CC as obesity/eating behaviour disorders, narcolepsy, neurological disease, 

CC addiction to narcotics, nicotine and alcohol, chronic pain, acute pain, 

CC migraine headaches and anxiety disorders. The polynucleotides encoding 

CC the neuropeptide receptor can also be used in gene therapy methods for 

CC treating such diseases 
XX 

SQ Sequence 425 AA; 

Query Match 100.0%; Score 2219; DB 4; Length 425; 

Best Local Similarity 100.0%; Pred. No. l.le-221; 

Matches 425; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVEWA 60 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNTLVCIAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPII YNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



I I I I 
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RESULT 2 
AAB67489 

ID AAB67489 standard; protein; 425 AA. 
XX 

AC AAB67489; 
XX 

DT 29-MAY-2001 (first entry) 
XX 

DE Amino acid sequence of a human hypocretin (orexiri) receptor 1. 
XX 



KW Human; hypocretin receptor 1; orexin receptor 1; HCRTRl; chromosome. 1; 

KW lp33; central nervous system modulator. 

XX 

OS Homo sapiens. 
XX 

PN WO200114555~Al. 
XX 

PD Ol-MAR-2001. 
XX 

PF 22-AUG-2000; 2.000WO-US022986 . 
XX 

PR 23-AUG-1999; 99US-00379083 . 

PR 07-JAN-2000; 2000US-00479128 . 
XX 

PA (DECO-) DECODE GENETICS EHF. 
XX 

PI Olafsdottir BR, Gulcher J; 
XX 

DR WPI; 2001-211306/21. 

DR N-PSDB; AAF55159. 
XX 

PT Novel isolated nucleic acid molecule encoding hypocretin (orexin) 

PT receptor 1 useful for treating and diagnosing narcolepsy. 

XX 

PS Disclosure; Fig lA; 44pp; English. 
XX 

CC The present sequence represents a human hypocretin (orexin) receptor 1 

CC (HCRTRl) polypeptide. The HCRTRl gene is present on chromosome 1, 

CC location lp33. It is likely that a mutation in the HCRTRl gene is 

CC associated with narcolepsy. HCRTRl is a central nervous system modulator. 

CC The HCRTRl polypeptide and polynucleotide are useful for diagnosing or 

CC treating narcolepsy in an individual. The HCRTRl polynucleotide is a 

CC source of probes and primers, and is also used to produce the protein 

CC recombinantly 

XX 

SQ Sequence 425 AA; 

Query Match 100.0%; Score 2218; DB 4; Length 425; 
Best Local Similarity 99.8%; Pred. No. 1.4e-221; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I 11 I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I M I M I I I I I I I I I I 

Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I M I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 

Db 



181 
181 



AVMECSSVLPELANRTRLFSVCDERWADDLYPKI YHSCFFIVTYLAPLGLMAMAYFQI FR 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M M I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I M I I M I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAWACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I 1 I I I I I I II I II 
Db 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I II I I I II I II I i I I I II I I I I II I I II I I I I I I I I I II I I I I I I II : I I I II I II I II I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 4'21 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 3 
AAU11188 

ID AAU11188 standard; protein; 425 AA. 
XX 

AC AAU11188; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFG7\N72X variant. 
XX 

KW Human; G protein-coupled receptor; GPCR; HFGAN72X; bacterial infection; 

KW fungal infection; protozoan infection; viral infection; 

KW human immunodeficiency virus; HIV; cancer; diabetes; Parkinson's disease; 

KW osteoporosis; myocardial infarction; ulcer; asthma; allergy; 

KW angina pectoris; renal disease; depression; schizophrenia; anorexia; 

KW obesity; Kallman ' s syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy. 

XX 

OS . Homo sapiens . 
XX 

PN US2001025031-A1 . 
XX 

PD 27-SEP-2001. 
XX 

PF 06--APR-2001; 2001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-0088524P . 

PR 22-JUL-1998; 98US-0093726P. 

PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS. C E. 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok C, Bodsworth NJ, Halsey W, Horn SV; 
XX 

DR WPI; 2001-624968/72. 



DR N-PSDB; AAS17464. 
XX 

PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

FT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers . 
XX 

PS Claim 23; Fig 6; 75pp; English. 
XX 

CC The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

CC HFGAN72 is a G protein-coupled receptor (GPCR) . HFGAN72 is useful for ^the 

CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is 

CC administered by providing to the patient DNA encoding HFGAN72 and 

CC expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV) -1 or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

• CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman*s syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72X variant, 

CC encoded by an alternative allele of HFGAN72 
XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2215; DB 4; Length 425; 

Best Local Similarity 99.8%; Pred. No. 2.8e-221; 

Matches 424; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AWECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



Qy 

Db 

Qy 

Db 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I 11 I I I I I I M I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I 

361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

421 TTVLP 425 

I I I I I 
421 TTVLP 425 



ADL22443; 

20-MAY-2004 (first entry) 

Human orexin 1 receptor variant protein. 

polydipsia; single nucleotide polymorphism; SNP; orexin 1 receptor gene; 
schizophrenia; human; mutant; mute in. 

Homo sapiens. 
Synthetic. 

Key Location/ Qualifiers 

Misc-dif ference 408 

/note= "The wild-type residue of He is substituted with 
Val at this position following a single nucleotide 
polymorphism in the encoding gene" 

JP2004041055-A. 
12-FEB-2004. 

lO-JUL-2002; 2002 JP-00201575 . 

lO-JUL-2002; 2002 JP-00201575 . 

(RIKA ) RIKAGAKU KENKYUSHO. 

WPI; 2004-208085/20. 
N-PSDB; ADL22427. 

Estimating whether subject has factor of polydipsia^ comprises 
determining single nucleotide polymorphism in orexin 1 receptor gene 
and/or at least one polymorphism in linkage disequilibrium. 

Claim 11; Page; 31pp; Japanese. 

The invention relates to a novel method for estimating whether a subject 
has a factor of polydipsia. The method comprises determining a single 
nucleotide polymorphism (SNP) at position 1222 of a fully defined orexin 
1 receptor gene sequence of 1411 nucleotides^ as given in the 
specification, and/or at least one polymorphism in the linkage 
disequilibrium from a biological sample obtained from a subject. A 
polynucleotide of at least 10 contiguous bases comprising the SNP at 



CC position 1222 is useful for estimating whether a subject comprises a 

CC factor of polydipsia. A polypeptide having a polymorphic variation in the 

CC human orexin 1 receptor or its fragment^ or a transformed cell which 

CC expresses the polypeptide is useful for the screening of a compound that 

CC controls the function of the human orexin 1 receptor. The method allows 

CC detection of polydipsia, which is a serious symptom of schizophrenia and 

CC therefore useful in the selection of a treatment for preventing the 

CC symptom. This sequence represents the variant protein following the SNP 

CC at position 1222 of the 1411 nt human orexin 1 receptor gene of the 

CC invention. Note: This sequence is not shown in the specification. It has 

CC been created from the protein of SEQ ID No 2 and information provided in 

CC claim 11 of the specification. 

XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2215; DB 8; Length 425; 

Best Local Similarity 99.8%; Pred. No. 2.8e-221; 

Matches 424; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVA 60 

LVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I 
LVGNTLVCLAVWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

VT PYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGS ILGIWAVSLAIMVPQA 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I M I I I I I I I I I I I I I I I i 
MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN PIIYNF 360 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



I I I I I 



wy 


1 

JL 


uD 


1 
J. 






Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



RESULT 5 
AAW80456 

ID AAW80456 standard; protein; 425 AA. 
XX 

AC AAW80456; 



XX 

DT 26-JAN-1999 (first entry) 
XX 

DE G-protein coupled receptor (HFGAN72X) polypeptide. 
XX 

KW G-protein coupled receptor; HFGAN72X; HIV infection; anorexia; cancer; 

KW bulimia; asthma; Parkinson's disease; acute heart failure; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW benign prostatic hypertrophy; neurological disorder. 

XX 

OS Homo sapiens. 
XX 

PN EP875566-A2. 
XX 

PD 04-NOV-1998. 

XX 

PF 27-OCT-1997; 97EP-00308563 . 
XX 

PR 30-APR-1997; 97US-00846704 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 1998-559432/48. 

DR N-PSDB; AAV63468. 
XX 

PT New human G-protein coupled receptor HFGAN72X polypeptide and 

PT polynucleotide - useful as diagnostic reagents and for treating e.g. HIV 

PT infection, cancer and Parkinson's disease. 

XX 

PS Claim 1; Page 7-8; 24pp; English. 
XX 

CC The present sequence represents a G-protein coupled receptor (HFGAN72X) 

CC polypeptide. HFGAN72X polypeptides and polynucleotides are useful for 

CC diagnosing diseases related to over or under expression of HFGAN72X 

CC proteins by identifying mutations in the HFGAN72X gene using HFGAN72X 

CC probes, or determining HFGAN72X protein or mRNA expression levels. 

CC HFGAN72X polypeptides are also useful for screening for compounds which 

CC affect activity of the protein. Diseases that can be treated with 

CC HFGAN72X include HIV infections, pain, anorexia, cancers, bulimia, 

CC asthma, Parkinson's disease, acute heart failure, hypotension, 

CC hypertension, urinary retention, osteoporosis, angina pectoris, 

CC myocardial infarction, ulcers, allergies, benign prostatic hypertrophy, 

CC and psychotic and neurological disorders 

XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2214; DB 2; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I i I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 
I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

181 AVMECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYIAPLGLMAm 240 

I I I I I I I I I I I I I i I I I I I I I I I I I I I I I i I I I I I I I I I I I li I I I I I I I I I I I I M I M 
181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAK^ 300 

301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I M M I I I I I 
361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

421 TTVLP 425 

I I I I I 
421 TTVLP 425 



RESULT 6 
AAU11186 

ID AAU11186 standard; protein; 425 AA. 

XX 

AC AAU11186; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFGAN72X. 
XX 

KW Human; G protein-coupled receptor; GPCR; HFGAN72X; bacterial infection; 

KW fungal infection; protozoan infection; viral infection; 

KW human immunodeficiency virus; HIV; cancer; diabetes; Parkinson's disease; 

KW osteoporosis; myocardial infarction; ulcer; asthma; allergy; 

KW angina pectoris; renal disease; depression; schizophrenia; anorexia; 

KW obesity; Kallman's syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy. 
XX 

OS Homo sapiens. 
XX 

PN US2001025031-A1. 
XX 

PD 27-SEP-2001. 

XX 

PF 06-APR-2001; 2001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-0088524P . 

PR 22-JUL-1998; 98US-0093726P. 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS C E. 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok Bodsworth NJ, Halsey W, Horn SV; 

XX 

DR WPI; 2001-624968/72. 
XX 

PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

PT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers . 
XX 

PS Claim 8; Fig 2; 75pp; English. 

XX 

CC The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

CC HFGAN72 is a G protein-coupled receptor (GPCR) . HFGAN72 is useful for the 

CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is 

CC administered by providing to the patient DNA encoding HFGAN72 and 

CC expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV) -1 or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman's syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72X receptor being 

CC the product of a splice variant of HFGAN72 

XX 

SQ Sequence 425 AA; 

Query Match ^ 99.8%; Score 2214; DB 4; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNTLVCIAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 
I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I i I I I I I I 



I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Qy 


1 


Db 


1 


QY 


61 


Db 


61 


Qy 


121 


Db 


121 



Qy 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I 



Db 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I i I I I I I I I 
Db 241 KLWGRQIPGTTSALVR^^ArKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVL^fVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I 

Db 301 MVVLLVFALCYLPISVLWLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I M I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I * I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 7 


AAB67079 


ID 


AAB67079 standard; protein; 425 AA. 


XX 




AC 


AAB67079; 


XX 




DT 


lO-APR-2001 (first entry) 


XX 




DE 


Human HFGAN72 receptor SEQ ID NO: 13. 


XX 




KW 


Human; mouse; rat; Lig72A; Lig72B; neuropeptide receptor; HFGAN72; 


KW 


truncation mutant; ligand; neurodegenerative disorder; pain; 


KW 


eating disorder; behaviour disorder; mood disorder. 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO200100787-A2. 


XX 




PD 


04-JAN-2001. 


XX 




PF 


22-JUN-2000; 2000WO-US017251 . 


XX 




PR 


25-JUN-1999; 99US-0141156P . 


XX 




PA 


(SMIK ) SMITHKLINE BEECHAM CORP. 


PA 


(SMIK ) SMITHKLINE BEECHAM PLC. 


XX 




PI 


Bingham S, Darker J, Liu Martin JD, Parsons AA, Patel SR; 


XX 




DR 


WPI; 2001-071483/08. 


XX 




PT 


Polynucleotides encoding Lig 72A polypeptides or their variants, which 


PT 


are useful in the treatment of a disease or disorder associated with 


PT 


pain, e.g. enhanced or exaggerated sensitivity to pain, hyperalgesia. 


PT 


neuropathic pain and back pain. 


XX 




PS 


Claim 8; Fig 7; lOlpp; English. 


XX 





CC The present invention provides the protein and coding sequences for the 

CC human, mouse and rat HFGTM^ receptor ligand Lig72A. It also provides 

CC truncated mutant versions. These, and their agonists and antagonists, are 

CC all useful in the treatment of eating, neurodegenerative, behaviour, 

CC mood, sexual, hormonal and sleep disorders, pain, depression, epilepsy 

CC and acute inflammatory conditions 



XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2214; DB 4; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



yy 


1 
J. 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVA 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 LI 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 M 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


yy 


\j J. 


LVGNTLVCLAWRNHHMRT WN YFI WLS lADVLWAI CL PAS LLVDI TE SWLFGHALCK 


120 




M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 i 1 M 




Db 


61 


LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Ov 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 




1 Mill II Mill II 1 II M 1 1 1 II II M M II M II 1 II II II II 1 




Db 


121 


VI P YLQAVS VSVAVLTLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMA^1AYFQIFR 


240 




II II II 1 1 II 1 1 M 1 II 1 1 1 II 1 II II II II 1 II 1 II 1 II 1 1 II II 1 II M II 1 1 II II 1 




Db 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKTAKML 


300 




1 II II II II II II 1 II 1 1 II II II II 1 II 1 1 II 1 1 II 1 1 1 II 1 II 1 1 II II II 1 II 1 II 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALC YL P I SVLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN SAAN P 1 1 YNF 


360 




M 1 II 1 II II 1 1 1 1 II II II II 1 1 1 II 1 II 1 1 II 11 II II 1 1 1 1 II 1 1 II 1 II 1 1 II 1 II 




Db 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACFTFSHWLVYANSAANPI I YNF 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 




II 1 II 1 II II 1 M 1 II 1 II 1 II II II II II 1 II II 1 II M II II II 1 : II II II II 1 II 1 




Db 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 


TTVLP 425 




Db 


421 


II 1 1 1 

TTVLP 425 





RESULT 8 

AAG78345 

ID AAG78345 standard; protein; 425 AA. 
XX 

AC AAG78345; 
XX 

DT 22-JAN-2002 (first entry) 
XX 

DE Human HFGAN72X G coupled receptor polypeptide. 
XX 

KW Antibacterial; fungicide; virucide; protozoacide ; anti-HIV; analgesic; 



KW cytostatic; nootropic; antiparkinsonian; cardiant; antiulcer; 

KW antiasthmatic; tranquiliser ; neuroleptic; antidepressant; anticonvulsant; 

KW osteopathic; HIV infection; pain; cancer; anorexia; bulimia; 

KW Parkinson's disease; acute heart failure; hypotension; hypertension; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW ulcers; asthma; allergy; deliritim; dementia; 

KW benign prostatic hypertrophy; anxiety; schizophrenia; manic depression; 

KW dyskinesia; G coupled receptor; HFG7^72X; 7 transmembrane receptor, 
XX 

OS Homo sapiens . 
XX 

PN EP1154019-A2. 
XX 

PD 14-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203008 . 
XX 

PR 30-APR-1997; 97US-00846704 . 

PR 27-OCT-1997; 97EP-00308563 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
. XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-012659/02. 

DR N-PSDB; AAI64172. 

XX 

PT Nucleic acid encoding the HFGAN72X receptor, useful for diagnosis and 

PT treatment of e.g. infections^ cancer, anorexia, bulimia, Parkinson's 

PT disease, and acute heart failure. 
XX 

PS Claim 11; Page 7-8; 24pp; English. 
XX 

CC The present sequence is that of a human HFGAN72X polypeptide encoded by a 

CC cDNA shown in AAI64172. The specification describes a newly isolated 

CC polynucleotide encoding a HFGAN72X G coupled receptor polypeptide. The 

CC protein of the invention has antibacterial, fungicide, virucide, 

CC protozoacide, anti-HIV, cardiant, analgesic, cytostatic, nootropic, 

CC antiparkinsonian, antiulcer, antiasthmatic, tranquiliser, neuroleptic, 

CC antidepressant, anticonvulsant and osteopathic activities. HFGAN72X 

CC polynucleotides (PNs) are used to express HFGAN72X in vivo, to treat 

CC diseases requiring increased activity or expression of HFG7\N72X; for 

CC recombinant production of HFGAN72X; diagnose diseases by detecting 

CC mutations in genomic sequences and in chromosome identification and 

CC mapping. HFGAN72X polypeptides are used to raise specific antibodies; as 

CC therapeutic agents; to identify HFGAN72X protein-expressing clones; to 

CC purify HFGAN72X proteins; in vaccines. Cells transformed with HFGAN72X 

CC PNs are used to identify {ant ) agonists of HFGAN72X, useful 

CC therapeutically. Nucleic acids that inhibit expression of HFGAN72X and 

CC polypeptides that compete with ligands for binding to HFGAN72X proteins 

CC are also useful therapeutically and diagnostically . HFGAN72X-related 

CC diseases include infections (bacterial, viral, fungal or protozoal, 

CC particularly HIV-1 or -2); pain; cancer; anorexia; bulimia; Parkinson's 

CC disease; acute heart failure; hypotension; hypertension; urinary 

CC retention; osteoporosis; angina pectoris; myocardial infarction; ulcers; 

CC asthma; allergy; benign prostatic hypertrophy; anxiety; schizophrenia; 

CC manic depression; delirium; dementia; severe mental retardation .and 



CC dyskinesias 
XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2214; DB 5; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3,6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 






1 1 1 1 1 t 1 t 1 1 1 1 1 1 It 1 1 1 1 M t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 






1 1 1 1 1 i 1 t 1 1 i 1 t 1 1 t 1 1 1 i t 1 t 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 i II 1 1 1 t 11 1 1 1 1 1 1 1 i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 




Db 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWY7VICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 






t 1 1 1 1 1 t 1 1 1 1 1 t t 1 1 1 1 1 t 1 1 1 t 1 1 t 1 II t II t 1 1 t 1 I 1 t 1 t t f 1 t t i It II 1 1 1 II II 
1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 < 




Db 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRT^GSILGIWAVSLAIMVPQA 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLM75jy[AYFQIFR 


240 






1 1 t 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 i 1 1 t 1 1 11 1 1 i II 1 1 II II 11 1 1 1 1 II 
1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 < 1 1 1 < 1 I I 1 i 1 1 




Db 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 






1 1 1 II 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 II 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 




Db 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPII YNF 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 


TTVLP 425 




Db 


421 


1 1 1 1 1 

TTVLP 425 





RESULT 9 
ABP81941 



ID ABP81941 standard; protein; 425 AA. 

XX 

AC ABP81941; 
XX 

DT 04-MAR-2003 (first entry) 

XX 

DE Human orexin receptor 1 protein SEQ ID NO: 368. 
XX 

KW G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 

KW G protein-coupled receptor modulator; antibody; immune-related disease; 

KW growth-related disease; cell regeneration-related disease; AIDS; cancer; 

KW immunological-related cell proliferative disease; autoimmune disease; 

KW Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 

KW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 



KW graft versus host disease; Parkinson *s disease; multiple sclerosis; pain; 

KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 

KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 

KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 

KW ulcer. 

XX 

OS Homo sapiens. 
XX 

PN WO200261087-A2. 
XX 

PD 08-AUG-2002. 
XX 

PF 19-DEC-2001; 2001WO-US050107 . 
XX 

PR 19-DEC-2000; 2000US-0257144P. 

XX 

PA (LIFE-) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 

DR ,N-PSDB; ABZ42789. 
XX 

PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR) , useful for diagnosing and designing drugs for treating conditions 

PT in which GPCRs are involved, e.g. AIDS, Alzheimer's disease, cancer or 

PT autoimmune diseases. 

XX 

PS Disclosure; Fig 1; 523pp; English, 
XX 

CO The present invention describes antigenic peptides (I) comprising: (a) 

CC any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 

CC acids. Also described: (1) an assay for the detection of a particular G 

CC protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 

CC and (2) an isolated antibody having high specificity and high affinity or 

CC avidity for a particular GPCR. (I) can be used as GPCR modulators and in 

CC gene therapy. The antigenic peptides for GPCRs are useful in detecting an 

CC antibody against a particular GPCR, and in the production of specific 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

. CC treating immune- related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological-related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer's disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn's disease, diabetes, graft versus host 

CC disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 

CC anxiety, depression, schizophrenia, dementia, mental retardation, memory 

CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and immunodiagnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2214; DB 6; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 



0; 



Ov 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


yy 


61 


LVGNT LVCLAVWRNHHMRTVTN YFI VNLS LADVLVT AI C L PAS LLVDI T ESW L FGHAL C K 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSL7VDVLVTAICLPASLLVDITESWLFGHALCK 


120 


wy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 




llllllllllllllllllll Illlllllllllllllllllllllllllllllll 




Db 


121 


VT P YLQAVSVS VAVLTLS FX ALDRWYAI CHPLLFKSTARRARGS I LGI WAYS LAIMVPQA 


180 


wy 


1 ftl 

X O X 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 




1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 N 1 M 1 1 




Db 


181 


AVMECSSV1.PELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Ov 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllllllllll 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 




1 1 1 1 1 1 1 11 1 1 1 1 1 1 II 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 




1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II : 1 II 1 1 1 1 1 1 1 II 




Db 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 


TTVLP 425 




Db 


421 


1 1 1 1 1 

TTVLP 425 





RESULT 10 


ABG75058 


ID 


ABG75058 standard; protein; 425 AA. 


XX 




AC 


ABG75058; 


XX 




DT 


12-FEB-2004 (first entry) 


XX 




DE 


Human orexin receptor 1 protein. 


XX 




KW 


Energy homeostasis; mouse; metabolism; triglyceride; metabolic disease; 


KW 


gene therapy; anorectic; immunomodulator; antidiabetic; hypotensive; 


KW 


cardiant; osteopathic; antilipemic* 


XX 




OS 


Homo sapiens. 


XX 




PN. 


WO2003075945-A2. 


XX 




PD 


18-SEP-2003. 



XX 

PF 14-MAR-2003; 2003WO-EP002714 , 

XX 

PR 14-M7VR-2002; 2002EP-00005882 , 

PR 15-MAR-2002; 2002EP-00006012 . 

PR 20-MAR-2002; 2002EP-00006271 . 

PR 25-MAR-2002; 2002EP-00006810 . 
XX 

PA (DEVE-) DEVELOGEN ENTWICKLUNGSBIOLOGISCHE FORSCH. 
XX 

PI Eulenberg K, Steuernagel A, Haeder T, Broenner G; 
XX 

DR WPI; 2003-748334/70. 

DR N-PSDB; ACH00818. 
XX 

PT New pharmaceutical composition comprising a nucleic acid molecule 

PT encoding proteins regulating the energy homeostasis and metabolism of 

PT triglycerides useful for detecting or preventing metabolic diseases, e.g. 

PT obesity. 

XX 

PS Claim 3; Fig 7G; 140pp; English. 
XX 

CC The present invention relates to pharmaceutical compositions comprising 

CC the coding sequences shown in ACH00815-ACH00827 , or their encoded 

CC proteins {shown in ABG75054, ABG75056-ABG75067 ) . These are proteins 

CC involved in the metabolism of triglycerides and in energy homeostasis, 

CC and their coding sequences. The composition is useful for the manufacture 

CC of an agent for detecting, verifying, treating, alleviating or preventing 

CC disorders, including metabolic diseases such as obesity and other body- 

CC weight regulation disorders as well as related disorders such as 

CC metabolic syndrome, eating disorder, cachexia, diabetes mellitus, 

CC hypertension, coronary .heart disease, hypercholesterolaemia, 

CC dyslipidaemia, osteoarthritis or gallstones, in cells, cell masses, 

CC organs or subjects. The coding sequences can be used in the production of 

CC transgenic animals which under- or over-produce the gene of interest. The 

CC present sequence is a protein of the invention 

XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2214; DB 7; Length 425; 
Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 




Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 



61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 




Db 



61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 




Db 



121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 
i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 



Db 



181 AVMECSSVLPELMIRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLM^^ 240 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



I I I I I I I I I I I I i I I I I M I I I I I M I I I I M I I I I I I I I 1 I I I I I I I I I I I I I I I M I 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 
IWVLLVFTU^CYLP I S VLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANP 1 1 YNF 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN P II YN F 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I ^1 I I I M I M I I I 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 

TTVLP 425 
I I I I I 



RESULT 11 
ADK52564 

ID ADK52564 standard; protein; 425 AA. 

XX 

AC ADK52564; 
XX 

DT 06-MAY-2004 (first entry) 
XX 

DE Hematological disorder associated Gene ID 14393 encoded protein. 
XX 

KW cytostatic; antianemic; antisickling; virucide; hemostatic; nephrotropic; 

KW cytostatic; thrombolytic; antiparasitic; gene therapy; 

KW hematologic disorder; cancer; Sickle Cell Anemia; 

KW Infectious Mononucleosis; Leukemia; Polycythemia Vera; Lymphoma; 

KW Retinoblastoma; Hemophilia; Thrombosis; Herpes; Thalassemia; 

KW transfusion reaction; Erythroblastosis; mechanical trauma; 

KW micro-angiopathic hemolytic anemia; parasite infection. 

XX 

OS Homo sapiens. 
XX 

PN WO2003065871-A2. 
XX 

PD 14-AUG-2003. 
XX 

PF 28-JAN-2003; 2003WO-US002484 . 
XX 

PR 04-FEB-2002; 2002US-0354333P . 

PR 28-FEB-2002; 2002US-0360258P . 

PR 15-MAR-2002; 2002US-0364476P . 

PR 26-APR-2002; 2002US-0375626P . 

PR 06-JUN-2002; 2002US-0386494P . 

PR 24'JUN-2002; 2002US-0390965P . 

PR 28-JUN-2002; 2002US-0392480P . 

PR 03-JUL-2002; 2002US-0394128P . 

PR 31-JUL-2002; 2002US-0399783P . 

PR 13-AUG-2002; 2002US-0403221P . 

PR 30-AUG~2002; 2002US-04 07045P . 

PR 25-NOV-2002; 2002US-0429048P . 



XX 

PA (MILL-) MILLENNIUM PHAPM INC. 
XX 

PI Carroll JM, Healy A, Welch NS, Kelly LM; 
XX 

DR WPI; 2003-731464/69. 

DR N-PSDB; ADK52563. 
XX 

PT Identifying a compound capable of treating a hematologic disorder (e.g. 

PT anemia or leukemia) comprises assaying the ability of the compound to 

PT modulate the expression or activity of e.g. 131,148, 199 or 12303 

PT polypeptide or nucleic acid. 
XX 

PS Disclosure; SEQ ID NO 22; 232pp; English. 
XX 

CC The invention relates to a method of identifying a compound capable of 

CC treating a hematologic disorder comprises assaying the ability of the 

CC compound to modulate 131,148, 199, 12303, 13906, 15513, 17822, 302, 5677, 

CC 194, 14393, 28059, 7366, 12212, 1981, 261, 12416, 270, 1410, 137, 1871, 

CC 13051, 1847, 1849, 15402, 340, 10217, 837, 1761, 8990 or 13249 nucleic 

CC acid expression or polypeptide activity, thus, identifying a compound 

CC capable of treating a hematologic disorder. The methods are useful in 

CC diagnosing, preventing and treating hematological disorders, such as 

CC cancer. Sickle Cell Anemia, Infectious Mononucleosis, Leukemia, 

CC Polycythemia Vera, Lymphoma, Retinoblastoma, Hemophilia, disorders 

CC associated with an increased risk of Thrombosis, Herpes, Thalassemia, 

CC antibody-mediated disorders such as transfusion reactions and 

CC Erythroblastosis, mechanical trauma to red blood cells such as micro- 

CC angiopathic hemolytic anemias, infections by parasites or chemical 

CC injuries. The methods may also be used for identifying compounds that 

CC modulate hematological disorders. This sequence corresponds to the 

CC protein encoded by one of the genes modulated by the compounds. 

XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2214; DB 7; Length 425; 

' Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVEWA 60 

I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I n M 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNTLVCLAWRNHHMRTWNYFIWLSL7U)VLVTAICLPASLLVDITESWLFGHALCK 120 

I i I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 
I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKI YHSCFFIVTYLAPLGLMAMAYFQI FR 2 4 0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



Qy 



241 



KLWGRQIPGTTS7y:,VRNWKRPSDQLGDLEQGLSGEPQPRABAFLAEVKQMRARRKTAKML 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



Qy 



301 MVVLLVFALCYLPISVLNVLKRWGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



.301 MWLLVFALCYLPISVL^r\/LKRVFGMFRQASDREAWACFTFSHWLVYANSAANPIIYNF 360 




Db 



Qy 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 




Db 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



421 TTVLP 425 



Db 



421 TTVLP 425 



RESULT 12 
ADL22428 

ID ADL22428 standard; protein; 425 AA. 
XX 

AC ADL22428; 
XX 

DT 20-MAY-2004 (first entry) 
XX 

DE Human orexin 1 receptor protein. 
XX 

KW polydipsia; single nucleotide polymorphism; SNP; orexin 1 receptor gene; 

KW schizophrenia; human. 

XX 

OS Homo sapiens. 
XX 

PN JP2004041055-A. 
XX 

PD 12-FEB-2004. 
XX 

PF lO-JUL-2002; 2002 JP-00201575 . 
XX 

PR lO-JUL-2002; 2002 JP-00201575 . 
XX 

PA (RIKA ) RIKAGAKU KENKYUSHO. 
XX 

DR WPI; 2004-208085/20. 

DR N-PSDB; ADL22427. 
XX 

PT Estimating whether subject has factor of polydipsia, comprises 

PT determining single nucleotide polymorphism in orexin 1 receptor gene 

PT and/or at least one polymorphism in linkage disequilibrium. 

XX 

PS Claim 3; SEQ ID NO 2; 31pp; Japanese, 
XX 

CC The invention relates to a novel method for estimating whether a subject 

CC has a factor of polydipsia. The method comprises determining a single 

CC nucleotide polymorphism (SNP) at position 1222 of a fully defined orexin 

CC 1 receptor gene sequence of 1411 nucleotides, as given in the 

CC specification, and/or at least one polymorphism in the linkage 

CC disequilibrium from a biological sample obtained from a subject. A 

CC polynucleotide of at least 10 contiguous bases comprising the SNP at 

CC position 1222 is useful for estimating whether a subject comprises a. 



CC factor of polydipsia. A polypeptide having a polymorphic variation in the 

CC human orexin 1 receptor or its fragment, or a transformed cell which 

CC expresses the polypeptide is useful for the screening of a compound that 

CC controls the function of the human orexin 1 receptor. The method allows 

CC detection of polydipsia, which is a serious symptom of schizophrenia and 

CC therefore useful in the selection of a treatment for preventing the 

CC symptom. This sequence represents the protein of the 1411 nt human orexin 

CC 1 receptor gene of the invention. 

XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2214; DB 8; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 



Qy 


1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 11 1 1 1 1 1 1 




Db 


1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 




Db 


61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


12 1 VI PYLQAVSVSVAVLTLS FIALDRWYAICHPLLFKSTARRARGSI LGIWAVSLAIMVPQA 


180 


1 1 1 1 ■ 1 1 ■ 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 t 1 1 1 1 1 1 1 i 1 1 1 i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 M 1 M i 1 M 1 1 1 N M M 




Db 


121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 


Qy 


181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 M M 1 1 M 1 1 1 1 M 1 N 1 1 




Db 


181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 KLWGRQI PGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 


1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 MNAH^LVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 


360 


Qy 


361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 


M 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 : 1 1 1 1 1 1 M 1 M 1 




Db 


361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 TTVLP 425 




Db 


1 1 1 1 1 
421 TTVLP 425 





RESULT 13 
ADO29106 

ID ADO29106 standard; protein; 425 AA. 

XX 

AC ADO29106; 
XX 

DT 29-JUL-2004 (first entry) 
XX 

DE Human novel GPCR HCRTRl, SEQ ID NO: 205. 



XX 

KW G protein-coupled receptor; GPCR; drug screening; diagnosis; 

KW transgenic mouse; neurological disorder; adrenal gland disorder; 

KW colon disorder; intestinal disorder; cardiovascular disorder; 

KW muscular disorder; blood disorder; immune disorder; bone disorder; 

KW joint disorder; metabolic disorder; nutritive disorder; cancer; 

KW kidney disorder; liver disorder; lung disorder; breast disorder; 

KW ovary disorder; uterus disorder; prostate disorder; testis disorder; 

KW skin disorder; stomach disorder; pancreas disorder; spleen disorder; 

KW thymus disorder; thyroid disorder; antiparkinsonian; antimanic; 

KW cytostatic; antiinflammatory; vasotropic; antianginal; antiarrhythmic; 

KW CNS; central nervous system; respiratory; antidiarrhoeic; antidiabetic; 

KW virucide; hepatotropic; antibacterial; antianaemic; antiseborrhoeic; 

KW dermatological; antiulcer; antithyroid; antiallergic; anorectic; 

KW immunosuppressive; nephrotropic; gene therapy; GPCR modulator; human; 

KW receptor. 

XX 

OS Homo sapiens. 
XX 

FN WO2004040000-A2 . 
XX 

PD 13-MAY-2004. 
XX 

PF 09-SEP-2003; 2003WO-US028226 . 
XX 

PR 09-SEP-2002; 2002US-0409303P. 

PR 09-APR-2003; 2003US-0461329P . 
XX 

PA (PRIM-) PRIMAL INC. 
XX 

PI Gaitanaris GA^ Bergmann JE, Gragerov A, Hohmann J, Li F; 

PI Madisen L, Mcilwain KL, Pavlova MN, Vassilatis D, Zeng H; 
XX 

DR WPI; 2004-390329/36. 

DR N-PSDB; ADO29780. 
XX 

PT Novel mammalian G protein coupled receptors^ useful for identifying 

PT compounds that modulates diagnosing and treating disease condition 

PT associated with GPCR dysfunction e.g. autoimmune diseases, angina 

PT pectoris, Parkinson's disease. 
XX 

PS Claim 1; SEQ ID NO 205; 542pp; English. 
XX 

CC The invention relates to human and mouse G protein-coupled receptors 

CC (GPCRs) and nucleic acids encoding them. The invention also relates to 

CC sequences at least 90% identical to the GPCR proteins and nucleic acids 

CC of the invention; methods of treating, preventing or diagnosing diseases 

CC associated with GPCRs of the invention; methods of screening for 

CC compounds useful in the treatment of GPCR-related diseases; a transgenic 

CC mouse comprising a GPCR gene of the invention; a mouse comprising a 

CC mutation in a GPCR transgene or in an endogenous GPCR gene; cells derived 

CC from the trasngenic mice; kits comprising several mice, each of which has 

CC a mutation in a different GPCR gene of the invention; and kits comprising 

CC probes which hybridise to GPCR polynucleotides of the invention. The 

CC invention further discloses variants of the GPCR polypeptides and vectors 

CC comprising a GPCR nucleic acid. The GPCR nucleic acids and proteins may 

CC be used in the diagnosis, treatment or prevention of a wide variety of 



CC diseases including neurological disorders (e.g., Alzheimer's disease, 

CC depression, diabetic neuropathy, Parkinson's disease or schizophrenia); 

CC disorders of the adrenal gland; disorders of the colon or intestine 

CC (e.g., Crohn's disease, diarrhoea, food poisoning or irritable bowel 

CC syndrome); cardiovascular disorders (e.g., angina, cardiac arrhythmia or 

CC myocardial infarction); muscular disorders; blood disorders (e.g., 

CC anaemia or leukaemia); immune disorders (e.g., autoimmune disorders or 

CC AIDS); bone and joint disorders (e.g., osteoarthritis, rheumatoid 

CC arthritis, gout or osteoporosis); metabolic or nutritive disorders (e.g., 

CC obesity, enzyme deficiency- related diseases or vitamin deficiency-related 

CC diseases) ; and disorders of the kidney, liver, lung, breast, ovary, 

CC uterus, prostate, testis, skin, stomach, pancreas, spleen, thymus and 

CC thyroid (e.g., cancers). The present sequence represents a GPCR of the 

CC invention. Note: The full sequence data for this patent did not form part 

CC of the printed specification; those sequences not shown were obtained in 

CC electronic format directly from WIPO at 

CC ftp.wipo.int/pub/published_pct_sequences. 

XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2214; DB 8; Length 425; 

Best Local Similarity 99.5%; Pred. No. 3.6e-221; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


ME P S AT PGAQMGVP P GS RE P S PVP P D YEDEFLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 


bU 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 






1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


VI P YLQAVSVS VAVLTLS FIALDRWYAI CHPLLFKSTARRARGS I LGI WAVSLAIMVPQA 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 






1 1 M 1 U 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


MWLLVFALC YLPI SVLNVLKRVFGMFRQAS DREAVYACFTFSHWLVYANSAANP 1 1 YN F 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 


TTVLP 425 




Db 


421 


1 1 1 1 1 

TTVLP 425 





RESULT 14 

ABB56378 

ID ABB56378 standard; protein; 425 AA, 
XX 

AC ABB56378; 
XX 

DT 18-FEB-2002 (first entry) 
XX 

DE Non- endogenous human GPCR protein, SEQ ID NO: 549. 

XX 

KW Human; G protein-coupled receptor; GPCR; non-endogenous; mutant; 

KW constitutively activated GPCR; agonist; disease. 

XX 

OS Homo sapiens. 

OS Synthetic. 
XX 

PN WO200177172-A2. 

XX 

PD 18-OCT-2001. 
XX 

PF 05-APR-2001; 2001WO-US011098 . 
XX 

PR 07-APR-2000; 2000US-0195747P . 
XX 

PA (AREN-) ARENA PHARM INC. 
XX 

PI Lehmann-Bruinsma K, Liaw CW, Lin I; 
XX 

DR WPI; 2001-648759/74. 

DR N-PSDB; ABI98014. 
XX 

PT Identifying agonists of G protein-coupled receptors (GPCRs) for use in 

PT disease treatment, comprises contacting candidate compounds with versions 
PT . of GPCRs. 
XX 

PS Claim 1; Page 350-351; 394pp; English. 
XX 

CC The invention relates to G protein-coupled receptors (GPCRs) for which 

CC the endogenous ligand has been identified. Non-endogenous constitutively 

CC activated versions of known GPCRs are used in the invention for the 

CC direct identification of candidate compounds as receptor agonists, 

CC inverse agonists or partial agonists. Such agonists are useful as 

CC therapeutic agents for diseases or disorders associated with GPCRs. The 

CC present sequence is a non-endogenous version of a known human GPCR 

XX 

SQ Sequence 425 AA; 

Query Match 99.5%; Score 2209; DB 4; Length 425; 

Best Local Similarity 99.3%; Pred. No. 1.2e-220; 

Matches 422; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I M I M I I I I i i I i I M I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 



61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I 



Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M I I I I I I I i I I i I I I I I I M I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I Ml 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTKKML 300 

Qy 301 MWLLVFALCYLPISVL^^VXKRVFGMFRQASDREAVYACFTFSHWLVYANSA7^NPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1.1 J i I I I I i I I I M I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKTUVFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I M I I I I I I I I I I I : I I I I I I I I M I I 

Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 15 
AAE04740 

ID AAE04740 standard; protein; 425 AA. 

XX 

AC AAE04740; 
XX 

DT lO-SEP-2001 (first entry) 
XX 

DE Cynomolgous Monkey Orexin 1 Receptor. 
XX 

KW Cynomolgous monkey; Orexin 1 Receptor; 7 Transmembrane Receptor family; 

KW 7TM; gene therapy; vaccine; microbial infection; HIV-1; HIV-2; pain; 

KW cancer; diabetes; obesity; anorexia; bulimia; urinary retention; 

KW Parkinson's disease; acute heart failure; hypotension; hypertension; 

KW osteoporosis; angina pectoris; myocardial infarction; stroke; ulcer; 

KW asthma; allergy; benign prostatic hypertrophy; migraine; vomiting; 

KW psychotic disorder; neurological disorder; anxiety; schizophrenia; 

KW manic depression; depression; delirium; dementia; mental retardation; 

KW dyskinesia; Huntington's disease; Gilles de la Tourette's syndrome. 

XX 

OS Macaca f ascicularis . 
XX 

PN WO200140259-'A2. 
XX 

PD 07-JUN-2001. 
XX 

PF 04~DEC-2000; 2000WO-US032849 . 
XX 

PR 02-DEC-1999; 99US-0168553P . 

PR 28-NOV-2000; 2000US-00723781 . 



PA (SMIK ) SMITHKLINE BEECHAM CORP. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Ellis CE; 
XX 

DR WPI; 2001-408276/43. 

DR N-PSDB; AAD09335. 
XX . 

PT Novel Cynomolgous Monkey Orexin 1 Receptor polypeptides, for treating 

PT infections, pain, cancer, diabetes, obesity, asthma, schizophrenia, 

PT hypertension, urinary retention, Parkinson's disease and stroke. 
XX 

PS Claim 2; Page 28; 33pp; English. 

XX 

CC The present sequence is Cynomolgous Monkey Orexin 1 Receptor which is 

CC structurally related to members of 7 Transmembrane Receptor (7TM) family. 

CC The Orexin 1 Receptor polypeptide and polynucleotide are useful for 

CC treating bacterial, fungal, protozoan and viral infections, particularly 

CC infections caused by HIV-1 or HIV-2, pain, cancer, diabetes, obesity, 

CC anorexia, bulimia, Parkinson's disease, acute heart failure, hypotension, 

CC hypertension, urinary retention, osteoporosis, angina pectoris, 

CC myocardial infarction, stroke, ulcers, asthma, allergies, benign 

CC prostatic hypertrophy, migraine, vomiting, psychotic and neurological 

CC disorders including anxiety, schizophrenia, manic depression, depression, 

CC delirium, dementia and severe mental retardation, and dyskinesias, such 

CC as Huntington's disease or Gilles de la Tourette's syndrome. The 

CC polypeptide is also useful for structure-based design of its agonist, 

CC antagonist or inhibitor. The polynucleotide is useful for chromosome 

CC localisation studies and in gene therapy. The Orexin 1 Receptor 

CC polypeptide and polynucleotide are also useful as vaccines 

XX 

SQ Sequence 425 AA; 



Query Match 98.4%; Score 2183; DB 4; Length 425; 

Best Local Similarity 98.1%; Pred. No. 6.1e-218; 

Matches 417; Conservative 4; Mismatches 4; Indels 0; Gaps 0; 
Qy . 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n I I I I I I I I I I I I I I I I I i I : I I 

Db 1 MEPSATPGAQMRVPTGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLVA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPVSLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I i I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMR/^RKTAKML 300 

I I I I I I I I I I i I I I I I I I I I I I I I I I I I M I I I I : I I I I I I I I I M I I I M I I I I I I I M 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGQPQPRARAFLAEVKQMRARRKTAKML 300 



Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I M I M I I M I I I I i I I I I III II 1.1 I I I I I I I I M I I I I I I I II I I I I I I I I I II I 

Db 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAWACFTFSHWLWANSAA^ 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHVVLTSV 420 

II II I I I I I II I I I I II I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I 

Db 361 LSGKFREQFKAAFSCCLPGPGPCGSLKAPSPRSSASHKSLSLQSRCSVSKLSEHWLTSV 420 

Qy 421 TTVLP 425 

Mill 

Db 421 TTVLP 425 
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Uo U X/Z oOo O 


Sequence 


5^ Appli 


34 


4^4 

1 vJ *t 


X ^ * 


u 


"^70 
•J / u 


o 


TTc:— Oft — 77f^— Q71 —91 
UO uo //D zj 1 X ZX 


Sequence 


zX, /\pp± 




4^4 


1 Q 

X-7 • 




■^7 n 


o 


TTQ— Oft — 7 7tf;_Q7 1 —1 H/I 
Uo UO / / O i7 /X XU41 


Sequence 


1 U 4 ^ App 


«_> u 


4 "^4 


1 Q 
X Z7 . 




^^70 

O / W 




TT<?— HQ — 7 QQ— Qf^ ^— ^ 


Sequence 


5, Appli 


37 


4^4 


1 Q 

X -7 > 


(: 
\j 


■J / u 


*1 


TTC!_nQ — 4<^1 — 4^ ^;n— 9 
Uo Ui? flDX fio Do Z D 


Sequence 


ZD, App± 




4 '^4 


1 Q 

X ^ . 


a. 
o 


^7n 


4 


TTc:_nQ— e;^^;— 9Q n— 9 1 

UO Ui? O/O Zi7U ZX 


Sequence 


zX/ AppX 


39 


4*^4 


1 Q 

X 7 . 






4 


uo Ui7 O /D Z"U XUfl 


Sequence 


X U 4 / App 


40 


434 


19. 


5 


381 


1 


US-nR-1 Q9-9ft ft-9 




z , /\ppxx 


41 


434 


19. 


6 


381 


2 


US-08-687-355A-2 


Sequence 


2, Appli 


42 


434 


19. 


6 


381 


3 


US-09-200-673-16 


Sequence 


16, Appl 


43 


434 


19. 


6 


381 


4 


US-09-407~367-2 


Sequence 


2, Appli 


44 


434 


19. 


6 


381 


4 


US-09-708-392-11 


Sequence 


11, Appl 


45 


432.5 


19. 


5 


370 


3 


US-09-172-353-7 


Sequence 


1, i^pli 



ALIGNMENTS 



RESULT 1 
US~09-479-128~2 

; Sequence 2, Application US/09479128 

; Patent No. 6319710 

; GENERAL INFORMATION: 

; APPLICANT: Berglind Ran Olafsdottir 
; APPLICANT: Jeffrey Gulcher 

; TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 

; FILE REFERENCE: 2345.1005-001 

; CURRENT APPLICATION NUMBER: US/09/479,128 

; CURRENT FILING DATE: 2000-01-07 

; PRIOR APPLICATION NUMBER: US 09/379,083 

; PRIOR FILING DATE: 1999-08-23 

; NUMBER OF SEQ ID NOS : 22 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 2 
; LENGTH: 425 
; TYPE: PRT 



; ORGANISM: Homo Sapiens 
US-09-479-128-2 



Query Match 100.0%; Score 2218; DB 3; Length 425; 

Best Local Similarity 99.8%; Pred. No. 9.9e-193; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVEWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR?\RGSILGIWAVSLAIMVPQA 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 r I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I M I I I I I I I M I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I i I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 2 

US-08-846-704-2 

; Sequence 2, Application US/08846704 
; Patent No. 6020157 
; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 

APPLICANT: ELLIS, CATHERINE E. 

TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 

NUMBER OF SEQUENCES: 4 
; CORRESPONDENCE ADDRESS: 
; ADDRESSEE: RATNER & PRESTIA 

; STREET: P.O. BOX ,980 

CITY: VALLEY FORGE 
STATE: PA 
COUNTRY: USA 
; ZIP: 19482 

COMPUTER READABLE FORM: 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846, 704 
FILING DATE: 30-APR-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 
REFERENCE/ DOCKET NUMBER: GH-70002 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 610-407-0700 

; TELEFAX: 610-407-0701 

; TELEX: 846169 

; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 425 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-846-704-2 

Query Match 99.8%; Score 2214; DB 3; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.3e-192; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I M I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK' 120 




Db 



61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 



121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 




Db 



121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 




Db 



181 AVMECSSVLPELTJ^RTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Qy 



241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 





Db 



241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



Qy 



301 MWLLVFTVLCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 




Db 



301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



Qy 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 3 

US~09-211-823C-22 

Sequence 22, Application US/09211823C 
Patent No. 6664229 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 

TITLE OF INVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS 
TITLE OF INVENTION: OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 22 * 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-22 

Query Match 99.8%; Score 2214; DB 4; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.3e-192; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 
Db 



181 
181 



AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 



240 
240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I .1 I I I U I I I l.-l I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



Db 



301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I II I I I I I I I I I II I M I I I II I I I I II II II I I I II I I II II II M I M II 
301 MWLLVFALCYLPISVLimjKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



Qy 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 




Db 



361 LSGKFREQFKT^AFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



Qy 



421 TTVLP 425 



Db 



421 TTVLP 425 



RESULT 4 
US-08-846-704-4 

; Sequence 4, Application US/08846704 
; Patent No. 6020157 
; GENERAL INFORMATION: 
; APPLICANT: BERGSMA, DERK J. 
; APPLICANT: ELLIS, CATHERINE E. 
; TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: RATNER & PRESTIA 

; STREET: P.O. BOX 980 

CITY: VALLEY FORGE 

STATE: PA 

COUNTRY: USA 
; ZIP: 19482 

; COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846,704 

FILING DATE: 30-APR-1997 
; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 
; FILING DATE: 

ATTORNEY/AGENT INFORMATION: 

NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 

REFERENCE/ DOCKET NUMBER: GH-70002 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0701 
; TELEX: 846169 

; INFORMATION FOR SEQ ID NO: .4: 
; SEQUENCE CHARACTERISTICS: 
; LENGTH: 4 02 amino acids 



; TYPE: amino acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-846-704-4 

Query Match 95.0%; Score 2108; DB 3; Length 402; 

Best Local Similarity 100.0%; Pred. No. 8.5e-183; 

Matches 402; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLM7\MAYFQIFR 240 

I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MNA^LVFALCYLPISVLN^KRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 5 

US-08-462-509B-2 

; Sequence 2, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

; APPLICANT: Soppet, Daniel et al 

; TITLE OF INVENTION: Human Neuropeptide Receptor 

NUMBER OF SEQUENCES: 12 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Human Genome Sciences, Inc. 

; STREET: 9410 Key West Avenue 

CITY: Rockiville 

STATE: MD 

COUNTRY ; USA 

ZIP: 20850 
; COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



; OPERATING SYSTEM: PC -DOS /MS -DOS 

; SOFTWARE: Patentin Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/462, 509B 

FILING DATE: 05-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: WO PCT/US95/05616 

FILING DATE: 05-MAY-1995 
; " ATTORNEY/AGENT INFORMATION: 

NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 

REFERENCE/ DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 
; TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 402 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-462-509B-2 



Query Match 94.8%; Score 2104; DB 4; Length 402; 

Best Local Similarity 99.8%; Pred. No. 2e-182; 

Matches 401; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I i I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I i I I M I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VTPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELTs^RTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPR7VRAFIAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



Qy 

Db 



361 
361 



LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 
i I I I I i I I I M I I M I M I I I I I I I I I I I I I I I I I I I I I I I I 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 6 

PCT-US95-05616-2 

; Sequence 2, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT; LI, ET AL. 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; NUMBER OF SEQUENCES: 12 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

; STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: 3.5 INCH DISKETTE 

; COMPUTER: IBM PS/2 

; OPERATING SYSTEM: MS-DOS 

S0FTW7VRE: WORD PERFECT 5.1 
; CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: PCT/US95/05616 

; FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 

REFERENCE/DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 \ 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 
; . LENGTH: 402 AMINO ACIDS 

TYPE: AMINO ACID 

STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
PCT-US95-05616-2 

Query Match 94.8%; Score 2104; DB 5; Length 402; 

Best Local Similarity 99.8%; Pred. No. 2e-182; 

Matches 401; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I i I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIF^ 240 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKT/y^L 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAWACFTFSHWLWANSA;^ 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I M I I I I I I I I I I I I I I I I I I I I I I M 

301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 
361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 7 
US-08-846-705-2 

; Sequence 2, Application US/08846705 

; Patent No. 5935814 

; GENERAL INFORMATION: 

; APPLICANT: BERGSMA, DERK J. 

; APPLICANT: ELLIS, CATHERINE E 

; TITLE OF INVENTION: NOVEL G- PROTEIN COUPLED 

; NUMBER OF SEQUENCES: 5 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: RATNER & PRESTIA 

; STREET: P.O. BOX 980 

CITY: VALLEY FORGE 
; STATE : PA 

COUNTRY: USA 

ZIP: 19482 
. COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/846, 705 

FILING DATE: 30-APR-1997 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 

FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 

NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23/031 
; REFERENCE/ DOCKET NUMBER: GH-70003 

; TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 
; LENGTH: 389 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-846-705-2 



Query Match 85.7%; Score 1902.5; DB 2; Length 389; 

Best Local Similarity 96.1%; Pred. No. 3.3e-164; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I i I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

III : : III II I 
Db 361 LSG— -CKEKSLALSCPSCPGHDP 381 



RESULT 8 

US-09-211-823C~23 

Sequence 23, Application US/09211823C 
Patent No. 6664229 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 



TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



METHODS OF TREATMENT USING NOVEL LIGANDS 
OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 



; PRIOR APPLICATION NUMBER: US 60/069,785 
; PRIOR FILING DATE: 1997-12-16 
; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: FastSEQ for Windows Version 3,0 
; SEQ ID NO 23 

LENGTH: 389 

TYPE: PRT 

ORGANISM: HOMO ST^IENS 
US-09-211-823C-23 



Query Match 85.7%; Score 1902.5; DB 4; Length 389; 

Best Local Similarity 96.1%; Pred. No. 3.3e-164; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

III :: III II I 
Db 361 LSG— CKEKSLALSCPSCPGHDP 381 



RESULT 9 

PCT-US95-05616-6 

; Sequence 6, Application PC/TUS9505616 
; GENER7U. INFORMATION: 

APPLICANT: LI, ET AL, 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: ' CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

; STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 
STATE: NEW JERSEY 
COUNTRY: USA 



ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 
FILING DATE: concurrently 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: FERRARO, GREGORY D. 
REGISTRATION NUMBER: 36,134 
REFERENCE/ DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-1744 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 377 BASE PAIRS 
TYPE: AMINO ACID 
STRANDEDNESS : SINGLE 
TOPOLOGY: LINEAR 
MOLECULE TYPE: cDNA 
PCT-US95-05616-6 

Query Match 85.7%; Score 1901; DB 5; Length 377; 

Best Local Similarity 96.6%; Pred. No. 4.4e-164; 

Matches 366; Conservative 2; Mismatches 9; Indels 2; Gaps 1; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLI75JVYVAVFWA 60 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIT^YVAVFWA 60 

Qy . 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPG 379 

I I I :: I I I I 
Db 361 LSGCKEKSLVLSPSC~PG 377 



RESULT 10 
US-08-462-509B-4 

; Sequence 4, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

; APPLICANT: Soppet, Daniel et al 

; TITLE OF INA/ENTION: Human Neuropeptide Receptor 
; NUMBER OF SEQUENCES: 12 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences, Inc. 

; STREET: 9410 Key West Avenue 

; CITY: Rockiville 

; STATE: MD 

; COUNTRY: USA 

ZIP: 20850 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentin Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/462, 509B 

FILING DATE: 05-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/05616 
; FILING DATE: 05-MAY~1995 

; ATTORNEY/AGENT INFORMATION: 
; NAME: Wales, Michele M. 

; REGISTRATION NUMBER: 43,975 

; REFERENCE/ DOCKET NUMBER: PF168P1 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 4: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 369 amino acids. 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-462-509B-4 

Query Match 85.5%; Score 1897; DB 4; Length 369; 

Best Local Similarity 99.7%; Pred. No. 9.8e-164; 

Matches 362; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLyTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I M I I I I I I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db , 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMR7\RRKTAKML 300 

I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I i I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLimiKRVFGMFRQASDREAWACFTFSHWLWANSAANPIIW 360 

I I I I M I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IWVLLVFALCYLPISVLNVXKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 11 
PCT-US95-05616-4 

; Sequence 4, Application PC/TUS9505616 

; GENERAL INFORMATION: 

; APPLICANT: LI, ET AL. * 

; TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 
; OPERATING SYSTEM: MS-DOS 

; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 
; FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/ AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 

REFERENCE/ DOCKET NUMBER: 325800-268 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 369 BASE PAIRS 

TYPE: AMINO ACID 

STRANDEDNESS : SINGLE 

TOPOLOGY: LINEAR 



MOLECULE TYPE: cDNA 
PCT-US95~05616-4 



Query Match 85.5%; Score 1897; DB 5; Length 369; 

Best Local Similarity 99.7%; Pred. No. 9.8e-164; 

Matches 362; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I i I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy ' 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 12 
US-08-462-509B-6 

Sequence 6, Application US/08462509B 
Patent No. 6410701 
GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockiville 
STATE: MD 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: PatentIn Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 



; APPLICATION NUMBER: US/08/462, 509B 

FILING DATE: 05-JUN-1995 
; CLASSIFICATION: 435 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: WO PCT/US95/05616 

FILING DATE: 05-MAY-1995 
ATTORNEY/AGENT INFORMATION: 
; NAME: Wales, Michele M. 

; REGISTRATION NUMBER: 43,975 

REFERENCE/DOCKET NUMBER: PF168P1 
; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 301-309-8504 

; TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 372 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-462-509B-6 

Query Match 85.4%; Score 1894; DB 4; Length 372; 

Best Local Similarity 99.4%; Pred. No. 1.8e-163; 

Matches 361; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I 
Db 1 MEPSATPGAQMGVPPGSRDPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I M I I II M I M M I I I M I I I I I I M I I I I I M I I M I I I I I II I I M I I I I II II 

Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I II I I II II I I II I I II I II I II I II II II I II I I I I II I I II I I I I I I I I II I I II II 
Db 121 VTPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I M I II I I I I I II I I I I I I I I I I II 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I II I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANS7\ANPIIYNF 360 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 13 
US-09-426-290-2 

; Sequence 2, Application US/09426290 



; Patent No. 6410712 
; GENERAL INFORMATION: 

; APPLICANT: Berglind Ran Olafsdottir 
; APPLICANT: Jeffrey Gulcher 

; TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 

; FILE REFERENCE: 2345.2001-000 

; CURRENT APPLICATION NUMBER: US/09/426,290 

; CURRENT FILING DATE: 1999-10-25 

; NUMBER OF SEQ ID NOS : 24 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 444 

TYPE: PRT 
; ORGANISM: Homo Sapiens 
US-09-426-290-2 



Query Match 65.8%; Score 1460; DB 4; Length 444; 

Best Local Similarity 68.8%; Pred. No. 4.,6e~124; 

Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 5; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db . 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I M I M I I I I I I I I : I : I I I I I : I II : I II I I I I I I II I II : II 
Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III I I I I I II I I I I I I : I I I I I : I II II: II II I I : I I I I I i I I : I I Mi: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I 11:111111 : : I I I : I I I I I : I II : I I I 11:111111111 I I II I I : I : 
Db 204 TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK— RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : I I I I : II : I II I I II : II I II I II I I : I I I I 

Db 264 RKWKPLQPVSQ PRGPGQPTKSRMSAV7\AEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

11:11111111111 III III I I I I I I I I I I I I I I I I I II I I I I I I I I I : I I I I I 

Db 320 ISILNVLKRVFGMFAHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

III I : : I : I I I I I : I : : : I I : I 1. I I I I I : : I : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



RESULT 14 
US-09-119-788-2 

Sequence 2, Application US/09119788 

Patent No. 6166193 
GENERAL INFORMATION: 

APPLICANT: Yanagisawa^ Masashi 

TITLE OF INVENTION: CDNA CLONE MYl THAT ENCODES 
TITLE OF INVENTION: A NOVEL HUMAN 7 -TRANSMEMBRANE RECEPTOR 



NUMBER OF SEQUENCES: 2 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SmithKline Beecham Corporation 

; STREET: 709 Swedeland Road 

; CITY: King of Prussia 

STATE: PA 

; COUNTRY: United States of America 

ZIP: 19406 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

; CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/119,788 

FILING DATE: 21-JUL-1998 
; CLASSIFICATION: 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 60/053,790 

; FILING DATE: 25-JUL-1997 

ATTORNEY/AGENT INFORMATION: 
; NAME: King, William T 

; REGISTRATION NUMBER: 30,954 

REFERENCE/DOCKET NUMBER: GH50029 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 610-270-5515 

TELEFAX: 610-270-5090 

TELEX: 

; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 444 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-09-119-788-2 

Query Match 65.8%; Score 1459; DB 3; Length 444; 

Best Local Similarity 68.8%; Pred. No. 5.6e-124; 

Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 5; 



Qy 



17 



SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFVVALVGNTLVCLAVWRNH 75 



Db 



24 




Qy 



76 



HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 



Db 



84 




Qy 



Db 



136 



144 



TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III II I I II I 11 I I II : II I II : I I I II: II M M M I I I I I I I : I I III: 
TLSCI ALDRWYAI CHPLMFKSTAKRARNS I VI IWI VSCI IMI PQAI VMECSTVFPGLANK 203 



Qy 



196 



TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 



Db 



204 




Qy 256 I^WK--RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLP 313 

III : I I : I : I I M : M : I I I I I I I : I I I I I I I I I I : I I M 

Db 264 RKWKPLQPVSQ PRGPGQPTKSRMGAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

II : II II I I II I I I III III I I II I I I I II I I II I I M I M I I I I I I I : I I I I I 

Db 320 ISILNVLKRVFGMFAHTEDRETVYAWFTFSHWLVYANSA/JJ^PIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

III I: : |:| i III: I : ::ll:|l lllll::|: 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



RESULT 15 
US-08-513-974B~54 

Sequence 54, Application US/08513974B 
Patent No. 6114139 
GENERAL INFORMATION: 

APPLICANT: Hinuma, Shuji 
APPLICANT: Hosoya, Masaki 
APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki, Tetsuya 
APPLICANT: Fukusumi, Shoji 
APPLICANT: Ohgi, Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN, 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 380 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DIKE, BRONSTEIN, ROBERTS & CUSHMAN, LLP 
STREET: 130 Water Street 
CITY: Boston 
STATE: MA 
COUNTRY: USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/513, 974B 
FILING DATE: 14-SEP-1995 
CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/ JP95/01599 
FILING DATE: lO-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 
FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 
FILING DATE: 16-MAR-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 
FILING DATE: 20-JAN~1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-326611 



FILING DATE: 28-DEC-1994 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-270017 
; FILING DATE: 02-NOV-1994 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 

FILING DATE: 30-SEP-1994 
; PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: JP 6-236356 

; FILING DATE: 30-SEP-1994 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189274 
; FILING DATE: ll-AUG-1994 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: JP 6-189273 

FILING DATE: ll-AUG-1945 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189272 

FILING DATE: ll-AUG-1994 
ATTORNEY /AGENT INFORMATION: 
; NAME: Resnick, David S. 

; REGISTRATION NUMBER: 34,235 

REFERENCE/ DOCKET NUMBER: 45753 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-523-3400 

TELEFAX: 617-523-6440 
; INFORMATION FOR SEQ ID NO: 54: 
; SEQUENCE CHARACTERISTICS: 
; LENGTH: 263 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: 

TOPOLOGY: linear 
; MOLECULE TYPE: peptide 
US-08-513-974B-54 

Query Match 59.1%; Score 1311; DB 3; Length 263; 

Best Local Similarity 96.6%; Pred. No. 7.5e-lll; 

Matches 254; Conservative 2; Mismatches 7; Indels 0; Gaps 0; 



Qy 91 ADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICH 150 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ADVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVVVLTLSSIALDRWYAICH 60 

Qy 151 PLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERWADDL 210 

I I I I I M I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 PLLFKSTARRARGSILGIWAVSLAVMVPQAAVMECSSVLPELANRTRLLSVCDERWADDL 120 

Qy 211 YPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQ 270 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I 

Db 121 YPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLDDQGQ 180 

Qy 271 GLSGEPQPRARAFL7VEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQA 330 

III II I I II II I I II II II II II I II II II II II I II I I II I II I II II II II I I II il 
Db 181 GLSSEPQPRARAFLAEVKQMR75lRRKTAKMLMVVLLVFALCYLPISVLNVLKRVFGMFRQA 240 

Qy 331 SDREAVYACFTFSHWLVYANSAA 353 

II I II : I I I I I I I I I I I I I I I I I 



Db 



241 SDREAIYACFTFSHWLVYANSAA 263 



Search completed: October 14, 2004, 10:58 
Job time : 26.8791 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



October 14, 2004, 10:42:40 ; Search time 22.5986 Seconds 

(without alignments) 
1809.496 Million cell updates/sec 

US-10-070-532-2 

2219 

1 MEPSATPGAQMGVPPGSREP CSVSKISEHWLTSVTTVLP 425 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database 



PIR_79:* 
pirl : * 
pir2:* 
pir3:* 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 
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Query 
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ID 
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beta-l-adrenergic 
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bombesin receptor. 


A n 

U 


"^7 ft 
o / o 


1 "7 
1 / . 


n 
u 


oy 4 


o 

Z 


JO /z uy 


galanin receptor — 


41 


372.5 


16. 


8 


387 


2 
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galanin receptor 2 


42 


372.5 


16. 


8 
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2 
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43 
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6 
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2 
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neuropeptide Y rec 


44 


368.5 
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2 
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bombesin/ GRP rece 
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16. 


6 
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2 


A46632 


bombesin-like pept 



ALIGNMENTS 



RESULT 1 

350150 

gastric CCK-A receptor - rabbit 

C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 14-Jul-1995 #sequence__revision 21-Jul-1995 #text_change 20-Apr-2000 
C;Accession: S50150 

R; Reuben, M. ; Rising, L.; Prinz, C; Hersey, S.; Sachs, G. 
Biochim. Biophys. Acta 1219, 321-327, 1994 

A; Title: Cloning and expression of the rabbit gastric CCK-A receptor. 

A; Reference number: S50150; MUID: 95002144 ; PMID:7918628 

A;Accession: S50150 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-427 <REU> 

C'Superfamily: neurokinin 1 receptor 



Query Match 20,9%; Score 464.5; DB 2; 

Best Local Similarity 29.6%; Pred. No. 2.1e-31; 
Matches 128; Conservative 85; Mismatches 168; 



Length 427; 

Indels 51; Gaps 13; 



Qy 8 GAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VLIAAYVAVFWALVGN 64 

I I : I I I : : I I I : I II II I : I : : : : : II 

Db 9 GNASGIPP PCELGLDNETLFCLDQP PPSKEWQPAVQILLYSLIFLLSVLGN 59 

Qy 65 TLVCLAWRNHHMRTWNYFIWLSLMVLVTAICLPASLLVDITESWLFG^ 124 

III : II llllll I:::|:::|::: I : I : I :::::: II 1 II I I 
Db 60 TLVITVLIRNKRMRTVTNIFLLSLAISDLMLCLFCMPFNLIPNLLKDFIFGSALCKTTTY 119 

Qy 125 LQAVSVSVAVLTLSFIALDRWYAICHPLLFK — STARRARGSILGIWAVSLAIMVPQAAV 182 

I 1111:11 I : I : I : I I I M : I I I I : I II I I 
Db 120 LMGTSVSVSTLNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSFAIMTPYPIY 179 

Qy 183 MECSSVLPEIJi^RTRLFSVCDERWADDLYPKIYHSCFFIVTYIJVPLGLMAMAYF 242 

|:::| : ::| |: : :|: :: :| I :l :| I I :| 

Db 180 SNLVPFTKTNNQTANMCRFLLPSDVMQQAWHTFLLLILFLIPGIVMMVAYGMISLEL 236 

Qy 243 W-GRQIPGTTSALVRNWK ^ RPSDQLGDLEQGLSGEPQPRAJ^ 282 

: I : : : I : I I I : I : I II I I 

Db 237 YQGIKFDASQKKSAKERKASTGSGRFEDNDGCYLQRSKPTRQL-ELQQ-LSGGGGGRVSR 294 

Qy 283 F — lAEVKQMRARRKTAKMmVVLLVFALCYLPISVLNVLKRVFGM 340 

: : I : :: : II I I : : : : I II : : 1 I I : : : : : I 
Db 295 IHSSSSAAAmAKKRVIRMIJWIVVLFFLCWMPIFSANAWRAYDTV SAERRLSGTPI 351 

Qy 341 TFSHWLVYANSAT^PIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSL 400 

: I . I I : I I I I I I I ::: I I ' I I I I I I II I II : I : 

Db 352 SFILLLSYTSSCVNPIIYCFMNKRFRLGFMATFPCC-PNPGP PGPRAEAGEEEE 404 

Qy 401 SLQSRCSVSKIS 412 

: I I : I : I 
Db 4 05 GRTTRASLSRYS 416 



RESULT 2 
A41738 

neuropeptide Y receptor - fruit fly (Drosophila melanogaster) 
N; Alternate names: G protein-coupled receptor PR4 
C; Species: Drosophila melanogaster 

C;Date: 16-Sep-1992 #sequence_revision 16-Sep-1992 #text__change 09"Jul-2004 
C;Accession: A41738 

R;Li, X.J.; Wu, Y.N.; North, R.A. ; Forte, M. 
J. Biol. Chem. 267, 9-12, 1992 

A; Title: Cloning, functional expression, and developmental regulation of a 

neuropeptide Y receptor from Drosophila melanogaster. 

A; Reference number: A41738; MUID: 92112730; PMID: 1370455 

A;Accession: A41738 

A;Molecule type: mRNA 

A; Residues: 1--449 <LIA> 

A; Cross-references: UNIPROT : P25931; GB:M81490; NID:gl57996; PIDN:7Ws^8727 . 1; 

PID:gl57997 

C; Genetics: 

A; Gene: FlyBase : NepYr 

A;Cross-ref erences : FlyBase : FBgnO004842 
C; Super family: neurokinin 1 receptor 

C;Keywords: appetite; G protein-coupled receptor; transmembrane protein 



Query Match 



20.2%; Score 447.5; DB 2; Length 449; 



Best Local Similarity 29.4%; Pred. No. 5.9e-30; 

Matches 109; Conservative 63; Mismatches 138; Indels 61; Gaps 9; 



Qy 26 DYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNHHMRTVTNYFI 85 

II: I I : : : : I : : I : I I : I I II I : II I I M I I I 

Db ' 75 DYD LLSEDMWSSAYFKIIVYMLYIPI FIFALIGNGTVCYIVYSTPRMRTVTNYFI 129 

Qy 86 WLSIADVLVTAICLPASLL-VDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDR 144 

:|:: |:|:: I |:| : : I I II III : I Mill |: II |::ll 
Db 130 ASIAIGDimSFFCEPSSFISLFILNYWPFGLALCHFVNYSQAVSVLVSAYTLVAISIDR 189 

Qy 145 WYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDE 204 

: I I II : I I I I I : I :: I I : I I : : : : I I 

Db 190 YIAIMWPLKPRITKRYATFIIAGVWFIALATALPIPIVSGLDIPMSPWHTKCEKY-ICRE 248 

Qy 205 RWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQ 264 

I I I : :: I I I :: j : I : : I : : .1 I : I I 

Db 249 MWPSRSQEYYYTLSLFT^QFWPLGVLIFTYARITIRVWAKRPPGEA ETNRDQ 301 

Qy 265 LGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLNVLKRVF 324 

: I : : II || : : 1 : : II I : || : : I : I 
Db 302 RMARSKRKMVKMMLTWIVFTCCWLPFNILQLL 334 

Qy 325 GMFRQASDREAV YACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFS 374 

:| I I I I III :: Mill ::: :|l I 

Db 335 LNDEEFAHWDPLPYVWFAF-HWLAMSHCCYNPIIYCYMNARFRSGFVQLMHRMPG 388 

Qy 375 CCLPGLG 381 

II I : I 

Db 389 LRRWCCLRSVG 399 



RESULT 3 

139187 

neuropeptide Y/peptide YY receptor Y2 - human 

N; Alternate names: .neuropeptide y/peptide YY receptor type 2 

C; Species: Homo sapiens (man) 

C;Date: Ol-Mar-1996 #sequence_revision Ol-Mar-1996 #text_change 09-Jul-2004 
C;Accession: 139187; 139163; G02301 
R;Gerald^ C; Walker^ M.W. ; Vaysse, P.J. 
J. Biol. Chem. 270, 26758-26761, 1995 

A; Title: Expression cloning and pharmacological characterization of a human 

hippocampal neuropeptide Y/peptide YY Y2 receptor subtype. 

A;Reference number: 139187; MUID: 96070760; PMID:7592910 

A;Accession: 139187 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-381 <GER> 

A; Cross-references: UNIPROT: P49146; EMBL:U36269; NID: gl063633; PIDN:AAC50281 . 1; 
PID:gl063634 

R;Rose, P.M.; Fernandes^ P.; Lynch, J.S.; Frazier, S.T.; Fisher, S.M.; Kodukula, 

K.; Kienzle, B.; Seethala, R. 

J. Biol. Chem. 270, 22661-22664, 1995 

A; Title: Cloning and functional expression of a cDNA encoding a human type 2 
neuropeptide Y receptor. 

A;Reference number: 139163; MUID: 96032678; PMID:7559383 
A;Accession: 139163 



A; status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A;Residues: 1-133, 'AMSS-SSl <ROS> 

A; Cross-references: EMBL:U32500; NID: gl000750; PIDN:AAA93170 . 1; PID:gl000751 
R;Yan, H.; Yang, J- ; Marasco, J.; Yamaguchi, K. ; Brenner, S.; Collins, F. ; 
Karbon, W. 

submitted to the EMBL Data Library, December 1995 
A; Reference number: H01019 
A;Accession: G02301 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 

A;Residues: 1-171, ' G' , 173, * R' , 175-201, ' P 203-208, 'A' , 210-381 <YAN> 

A; Cross-references: EMBL:U42389; NID: gl314329; PIDN: AAB07760 . 1; PID:gl314330 

C; Genetics: 

A; Gene: GDB:NPY2R 

A; Cross-references: GDB: 4365607; OMIM: 162642 

A;Map position: 4q31-4q31 

C; Superf amily : neurokinin 1 receptor 

C;Keywords: appetite; G protein-coupled receptor; glycoprotein; lipoprotein; 

thiolester bond; transmembrane protein 

F; 49-76/Domain: transmembrane #status predicted <TM1> 

F; 87~113/Domain: transmembrane #status predicted <TM2> 

F; 166-186/Domain: transmembrane #status predicted <TM4> 

F; 22 1-2 37 /Domain: transmembrane #status predicted <TM5> 

F; 269-2 91/Domain: transmembrane #status predicted <TM6> 

F; 305-328/Domain: transmembrane #status predicted <TM7> 

F; 123-203/Disulfide bonds: #status predicted 

F; 342/Binding site: palmitate (Cys) (covalent) #status predicted 
F; 372/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 19.6%; Score 434; DB 2; Length 381; 

Best Local Similarity 21,6%; Pred. No. 6.8e-29; 

Matches 112; Conservative 74; Mismatches 140; Indels 80; Gaps 10; 



Qy 


3 


PSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALV 
III:: Mil: 1 : : | || I | :: :: :: 
PQTTPRGEL VPDPEPELI DSTKLIEVQWLILAYCSIILLGVT 


62 


Db 


24 


66 


Qy 


63 


GNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVI 

1 1 : II - 1 : 1 1 1 1 1 1 : 1 1 1 1 :: 1 1 : 1 1 : 1 1 1 : 1 : 1 1 1 1 :: 
GNSLVIHWTKFKSMRTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLCHLV 


122 


Db 


67 


126 


Qy 


123 


P YLQAVS VSVAVLTLS FI ALDRWYAI CHPLLFKSTARRT^GS I LGI -WAVSLAIMVPQAA 

II 1 ::| I: :||: Mill 1 : 1 1 ::| MM 1 :| : 1 1 
P YAQGLAVQVST I TLTVI ALDRHRCI VYHLESK- 1 S KRI S FLU GLAWGI SALLAS PLAI 


181 


Db 


127 


185 


Qy 


182 


VMECS--SVLPELANRTRLFSVCDERWADD LYPKIYHSCFFIVTYLAPLGLMAMAYF 

II :: 1 : 1 1 : 1 : : 1 : 1 : : | : || | : : : : | 
FREYS LI EI I PD FE 1 VACTEKWPGEEKS I YGTVYS LS SLLI LYVLPLGI I S FS YT 


236 


Db 


186 


240 


Qy 


237 


QIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKT 
: 1 : II II 1 : 1 : II 
RIWSKLKNHVSPGA ANDHYHQRRQKT 


296 


Db 


241 


266 


Qy 


297 


AKMLMWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AANP I 
1 1 1 : 1 :: II 1 : : 1 1 1 : : : 1 1 : II 1 : : : II 1 : 
TKMLVCWWFAVSWLP LHAFQLAVDIDSQVLDLKEYKLIFTVFHII7\MCSTFANPL 


356 


Db 


267 


323 



Qy 357 lYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

:| ::: :|: I :|| I I I I :|: 

Db 324 LYGWMNSNYRKAFLSAFRC EQRLDAIHSEVSV 355 



RESULT 4 
152315 

G protein-coupled receptor UHR-1 - rat 
C; Species: Rattus sp. (rat) 

C;Date: lO-Sep-1999 #sequence_revision lO-Sep-1999 #text_change 19-May-2000 
C;Accession: 152315 

R;Welch, S.K.; O'Hara, B.F.; Kilduff, T.S.; Heller, H.C. 
Biochem. Biophys . Res. Commun. 209, 606-613, 1995 

A;Title: Sequence and tissue distribution of a candidate G-coupled receptor 
cloned from rat hypothalamus. 

A; Reference number: 152315; MUID: 95251659; PMID: 7733930 
A;Accession: 152315 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 

A;Residues: 1-370 <RES> 

A; Cross-references: GB:S77867; NID:g998527; PIDN: AAB34129 . 1 ; PID:g998528 
C; Super family : neurokinin 1 receptor 

Query Match 19.5%; Score 432.5; DB 1; Length 370; 

Best Local Similarity 32.2%; Pred. No. 8.9e-29; 

Matches 106; Conservative 66; , Mismatches 116; Indels 41; Gaps 7; 

Qy 44 QYEWVLIAAWAVFWALVGNTLVCLAVWRNHHMRTWNYFIWLSIADVLVTAICLPAS 103 

I : :: : I I I I I I I I I : I : I : I I I : I I I : I : I I I : I I : I : 
Db 58 QLKGLIVMLYSIVVWGLVGNCLLVLVIARVRRLHNVTNFLIGNLALSDVLMCAACVPLT 117 

Qy 104 LLVDI-TESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRAR 162 

I I : I I I I : : : I I I : I I : I I I : I I : I I : : I I I : : : : " 

Db 118 LAYAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRYWLVHPLRRRISLKLSA 177 

Qy 163 GSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERW-ADDLYPKIYHSCFFI 221 

: : I I I I I : I : : I I : I : M I : I I : : : I I : 
Db 178 YAVLGIWALSAVLALPAAVHTYHVELKP HDVRL CEEFWGSQERQRQIYAWGLLL 231 

Qy 222 VTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRAR 281 

I I I I I : : : I : : I I I : I I : : : I III 

Db 232 GTYLLPLLAILLSYVRVSVKLRNRWPGSVTQSQADW DR7VR 272 

Qy 282 AFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFT 341 

l|:| :|:||::lllll:||: : I:|: : :| I I 
Db 273 RRRTFCLLWWWFALCWLPLHIFNLLR DLDPRAIDPYAFGLVQL 318 

Qy 342 FSHWLVYANSAANPIIYNFLSGKFREQFK 370 

I I I :: : I I II : I III:: 
Db 319 LCHWLAMSSACYNPFIYAWLHDSFREELR 347 



RESULT 5 
JN0692 

cholecystokinin type A receptor - human 
C;.Species: Homo sapiens (man) 



C;Date: 03-Feb-1994 #sequence_revision 03-Feb-1994 #text_change 09-Jul-2004 

C;Accession: JN0692; JN0590 

R;de Weerth, A.; Pisegna, J.R.; Huppi, K.; Wank, S.A. 
Biochem. Biophys . Res. Commun. 194, 811-818, 1993 

A; Title: Molecular cloning, functional expression and chromosomal localization 

of the human cholecystokinin type A receptor. 

A; Reference number; JN0692; MUID: 93343941; PMID: 8343165 

A;Accession: JN0692 

A; Molecule type: mRNA 

A; Residues: 1-42 8 <DEW> 

A; Cross-references: UNIPROT : P32238 ; GB:L19315; NID:g306595; PIDN : AAA02819 . 1; 
PID:g306596 

A; Experimental source: gallbladder 

R;Ulrich, CD.; Ferber, I.; Holicky, E.; Hadac, E. ; Buell, G. ; Miller, L.J. 
Biochem. Biophys. Res. Commun. 193, 204-211, 1993 

A; Title: Molecular cloning and functional expression of the human gallbladder 
cholecystokinin A receptor. 

A; Reference number: JN0590; MOID: 93277552 ; PMID: 8503909 
A;Accession: JN0590 

A; Status: nucleic acid sequence not shown 

A; Molecule type: DNA 
A; Residues: 1-428 <ULR> 

A; Cross-references: GB:L13605; NID:g306490; PIDN:AAA35659. 1; PID:g306491 
A; Experimental source: gallbladder 

C; Comment: This protein has diverse physiological roles in the gastrointestinal 

system where it mediates pancreatic growth and enzyme secretion, smooth muscle 

contraction of the gallbladder and stomach, and secretion from gastric mucosal 

cells . 

C; Genetics : 

A; Gene : GDB : CCKAR 

A; Cross-references: GDB: 141927; OMIM: 118444 
A;Map position: 4pter-4qter 
C;Superfamily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
phosphoprotein; transmembrane protein 

F; 40-67/Domain: transmembrane #status predicted <TM1> 
F;78-104/Domain: transmembrane #status predicted <TM2> 
F; 116-137/Domain: transmembrane #status predicted <TM3> 
F; 158-178/Domain: transmembrane #status predicted <TM4> 
F;208-234/Domain: transmembrane #status predicted <TM5> 
F; 314-332/Domain: transmembrane ftstatus predicted <TM6> 
F; 350-369/Domain: transmembrane #status predicted <TM7> 

F;10,24,190,299/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;245,249,260,290/Binding site: phosphate (Ser) (covalent) (by protein kinase C) 
#status predicted 

F;256/Binding site: phosphate (Ser) (covalent) (by protein kinase A) #status 
predicted 

Query Match 19.3%; Score 428.5; DB 2; Length 428; 

Best Local Similarity 29.2%; Pred. No. 2.2e-28; 

Matches 125; Conservative 82; Mismatches 180; Indels 41; Gaps 13; 
Qy 16 GSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VLIT^YVAVFWALVGNTLVCLAVW 72 



Db 11 GSNITPPCELGLENETLFCL— DQPRPSK-EWQPAVQILLYSLIFLLSVLGNTLVITVLI 67 



Qy 



73 RNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSV 132 



Db 


68 


Qy 


133 


Db 


128 


Qy 


191 


Db 


185 


Qy 


244 


Db 


245 


Qy 


288 


Db 


303 


Qy 


348 


Db 


360 


Qy 


408 


Db 


418 



II llllll |:::|:::|::: I : I : I :::::: I I I : II I MM 
RNKRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPNLLKDFIFGSAVCKTTTYFMGTSVSV 127 



I I : I : I : I M II : 1 I I I : I III I : I 



243 



: I I : I : I I I 



-GRQIPGTTSA LVRNWKRPSD-QLGDLEQGLSGEPQPRARAFLAEV 287 

: I I I I : : : : I Mill I I : : 



IIM:::M IMMI I : ::M M 



I M Mill I:: Ml llllll III: II II I 



I M I 



RESULT 6 
A42685 

cholecystokinin receptor type A - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 04-Mar-1993 #sequence_revision 18-Nov-1994 #text_change 09-Jul-2004 

C;Accession: A42685; JC4225; PC2213 

R;Wank, S.A. ; Harkins, R. ; Jensen, R.T. ; Shapira, H.; de Weerth^ A.; Slattery, 
T. 

Proc. Natl. Acad. Sci. U.S.A. 89, 3125-3129, 1992 

A; Title: Purification, molecular cloning, and functional expression of the 

cholecystokinin receptor from rat pancreas. 

A; Reference number: A42685; MUID: 92212981; PMID: 1313582 

A;Accession: A42685 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-444 <WAN> 

A; Cross-references: UNIPROT: P30551; GB:M88096; NID:g203383; PIDN: AAA40899 . 1; 
PID:g203384 

A; Experimental source: pancreas 

A;Note: sequence extracted f rom NCBI backbone (NCBIN : 93814 , NCBIP:93815) 

R;Takata, Y. ; Takiguchi, S.; Funakoshi, A.; Kono, A. 

Biochem. Biophys . Res. Commun. 213, 958-966, 1995 

A; Title: Gene structure of rat cholecystokinin type-A receptor. 

A; Reference number: JC4225; MUID: 95382845; PMID:7654260 

A; Accession: JC4225 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-223 <TAK> 

A; Cross-references: DDBJ:D50608; NID:gll00752 



R;Mantaniadiotis, T.; Baldwin, G.S. 

Biochem. Biophys . Res. Commun. 201, 1382-1389, 1994 

A; Title: The seventh transmembrane domain of gastrin/CCK receptors contributes 
to non-peptide antagonist binding. 

A;Reference number: PC2213; MUID: 94296413; PMID:8024583 . 
A; Accession: PC2213 
A; Status : preliminary 
A; Molecule type: protein 
A; Residues: 366-389 <MAN> 

C; Comment: This G-protein-coupled receptor is present in the gastrointestinal 
system, vagus nerve and localized areas of the central nervous system. It 
mediates pancreatic growth and enzyme secretion, smooth muscle contraction of 
the gallbladder and stomach. It is capable of activating phospholipase C and 
stimulating trunsduction by increasing levels of diacylglycerol, inositol 
phosphate, and inducing the subsequent release of intracellular calcium. 
C; Genetics: 
A; Gene: CCKAR 

A;Introns: 53/1; 137/1; 224/2; 267/1 
C;Superfamily: neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; hormone receptor; transmembrane protein 

F; 57-82/Domain: transmembrane #status predicted <TM1> 
F; 93-119/Domain: transmembrane #status predicted <TM2> 
F; 131-151/Domain: transmembrane #status predicted <TM3> 
F;173-193/Domain: transmembrane #status predicted <TM4> 
F;225-249/Domain: transmembrane #status predicted <TM5> 
F; 330-348/Domain: transmembrane ftstatus predicted <TM6> 
F; 366-389/Domain: transmembrane #status predicted <TM7> 

Query Match 19.0%; Score 421.5; DB 2; Length 444; 

Best Local Similarity 28.1%; Pred. No. 9.1e-28; 

Matches 120; Conservative 83; Mismatches 171; Indels 53; Gaps 12; 



Qy 


16 


GSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VLIAAYVAVFWALVGNTLVCLAVW 

II 1 |:| 1 1 1 1 : II : 1 1 :|:::::|||ll 
GSNITPPCELGLENETLFCL — DQPQPSK-EWQSALQILLYSIIFLLSVLGNTLVITVLI 


72 


Db 


26 


82 


Qy 


73 


RNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSV 

II II 1 1 1 1 1: 1 :: : 1 : : : 1 : 1 : 1 :::::: II 1 : II 1 1 1 1 1 
RNKRMRTVTNI FLLSLAVSDLMLCLFCMPFNLI PNLLKDFI FGSAVCKTTTYFMGTSVSV 


132 


Db 


83 


142 


Qy 


133 


AVLTLSFIALDRWYAICHPLLFK — STARRARGSILGIWAVSLAIMVPQAAVMECSSVLP 

: 1 I:|:|: III II : 1 1 1 1 :l III |:::1 
STFNLVAISLERYGAICRPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLVP 


190 


Db 


143 


199 


Qy 


191 


ELANRTRLFSVCDERWADDLYPKI YHSCFFIVTYLAPLGLMAMAYFQI FRKLW-GRQI PG 


249 


Db 


200 


1 : : : 1 I : : : : : : 1 1 : 1 : 1 1 1 : 1 : 1 : 
FTKNNNQTANMCRFLLPSDAMQQSWQTFLLLILFLLPGIVMWAYGLISLELYQGIKFDA 


259 


Qy 


250 


TTSALVRNWKRPSD QLGDLEQGLSGEPQPRARAFLAE 

: 1:11 :| 1 1 1 1 |: : 
SQKKSAKE-KKPSTGSSTRYEDSDGCYLQKSRPPRKLELQQLSSGSGGSRLNRIRS-SSS 


286 


Db 


260 


317 


Qy 


287 


VKQMRTVRRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWL 

: 1 : : : : M : 1 : : : : 1 1 1 : : 1 1 1 : : = = = = '\ 1 
7J\N LI AKKRVI RMLI VI WL FFLCWMP I FS ANAWRAYDT V S AEKHLS GT PISFILLL 


346 


Db 


318 


374 


Qy 


347 


VYANSAANPIIYNFLSGKFREQFK/^AFSCCLPGLGPCG ^ SLKAPSPRSS 


394 



1:1 Mill I:: :|| I I I II I III II 
Db 375 SYTSSCVNPIIYCFMNKRFRLGFMATFPCC--PNPGPPGVRGEVGEEEDGRTIRALLSRYS 433 

Qy 395 ASHKSLS 401 

II II 

Db 434 YSHMSTS 440 



RESULT 7 

S23510 

neurokinin 1 receptor - guinea pig 

N; Alternate names: NK-1 receptor; substance P receptor 

C; Species: Cavia porcellus (guinea pig) 

C;Date: 07-Apr-1994 #sequence_revision 07-Apr-1994 #text_change 09-Jul-2004 
C;Accession: S23510; S19198 

R;Gorbulev, V.; Akhundova, A.; Luzius, H. ; Fahrenholz, F. 
Biochim. Biophys . Acta 1131^ 99-102, 1992 

A; Title: Molecular cloning of substance P receptor cDNA from guinea-pig uterus. 
A; Reference number: S23510; MUID: 92256498 ; PMID: 1374648 
A;Accession: S23510 ^ 
A; Molecule type: mRNA 
A; Residues: 1-407 <GOR> 

A; Cross-references: UNIPROT : P30547 ; EMBL:X64323; NID:g49565; PIDN: CAA45608 . 1; 
PID:g49566 

C; Super family: neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; transmembrane protein 

F; 32-55/Domain: transmembrane #status predicted <TM1> 

F; 69-89/Domain: transmembrane #status predicted <TM2> 

F; 117-128/Domain: transmembrane #status predicted <TM3> 

F; 14 9-169/Domain: transmembrane #status predicted <TM4> 

F; 196-217/Domain: transmembrane #status predicted <TM5> 

F;250-280/Domain: transmembrane #status predicted <TM6> 

F;289-308/Domain: transmembrane #status predicted <TM7> 

Query Match 18.9%; Score 419; DB 2; Length 407; 

Best Local Similarity 27.6%; Pred. No. 1.3e-27; 

Matches 110; Conservative 71; Mismatches 145; Indels 72; Gaps 11; 

Qy 42 PKQY EWVLI AAYVAVFWALVGNTLVCLAVWRNHHMRTVTNYFIVNLSIADVLV 95 

II: I :: III : I : : I I I : I : : I I I I I I I I : I I I : I : : 
Db 22 PNQFVQPAWQIVLWAAAYTVIVWSWGNVVVMWIILAHKRMRTVTNYFLVNLAFAEASM 81 

Qy 96 TAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFI7VLDRWYAICHPLLFK 155 

I : : I : I II : : I :::::: | | I : I I I I I : 

Db 82 AAFNTWNFTYAVHNEWYYGLFYCKFHNFFPIAAVFASIYSMTAVAFDRYMAIIHPLQPR 141 

Qy 156 STARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERW ADDLYP 212 

:| : I II ::| : II I : I II I I :l 

Db 142 LSATATKWICVIWVLALLLAFPQGYY STTETMPGRV VCMIEWPSHPDKIYE 193 

Qy 213 KIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGL 272 

hill :: I II :: II : II :lll :l 
Db 194 KVYHICVTVLIYFLPLLVIGYAYTWGITLWASEIPGDSSDRYH 237 

Qy . 273 SGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASD 332 

: I : I : 1 I I I : : I I : I I : I : I I : : I : 
Db 238 . EQVSAKRKWKMMIVWCTFAICWLPFHIFFLLPYINPDLYLKKF 282 



Qy 



333 REAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPR 392 



Db 



283 IQQWIAIM---WIJ\MSSTMYNPIIYCCLNDRFRLGFKHAFRCC-PFI 326 



Qy 



393 SSASHKSLSL QSRCSVSKISEHWLTSVTTVL 424 



Db 



327 SAADYEGLEMKSTRYFQTQGSVYKVSR— LETTISTW 362 



RESULT 8 
A34357 

neurokinin 1 receptor - rat 

N;Alternate names: NK-1 receptor; substance P receptor 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 22-Jan-1993 #sequence_revision 22-Jan-1993 #text_change 09-Jul-2004 
C;Accession: A38692; A34357; A40089 
R;Hershey^ A.D.; Dykema, P.E.; Krause, J.E. 
J. Biol. Chem. 266, 4366-4374, 1991 

A;Title: Organization, structure, and expression of the gene encoding the rat 
substance P receptor. 

A; Reference number: A38692; MUID: 91154239; PMID: 1705552 
A;Accession: A38692 
A;Molecule type: DNA 
A; Residues: 1-407 <HER> 

A;Cross-references: UNIPROT : P14600; GB:M34751 

R;Yokota, Y. ; Sasai, Y. ; Tanaka, K. ; Fujiwara, T.; Tsuchida, K. ; Shigemoto, R. ; 
Kakizuka, A.; Ohkubo, H.; Nakanishi, S. 
J. Biol. Chem. 264, 17649-17652, 1989 

A;Title: Molecular characterization of a functional cDNA for rat substance P 
receptor. 

A;Reference number: A34357; MUID: 90036822; PMID:2478537 
A; Accession: A34357 
A;Molecule type: mRNA 
A; Residues: 1-407 <YOK> 

A;Cross-references: GB:J05097; NID:g207051; PIDN: AAA42175 . 1; PID:g207052 
R;Hershey, A.D.; Krause, J.E. 
Science 247, 958-962, 1990 

A; Title: Molecular characterization of a functional cDNA encoding the rat 
substance P receptor. 

A;Reference number: A40089; MUID: 90161991; PMID:2154852 

A; Accession: A4008 9 
A; Status: preliminary 
A;Molecule type: mRNA 

A;Residues: 1-74, ' R' , 76-212, • A* , 214-407 <HE2> 

A; Cross-ref erences : GB:M31477 

C; Super family : neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; membrane protein 

Query Match 18.8%; Score 418; DB 2; Length 407; 

Best Local Similarity 27.6%; Pred. No. 1.6e-27; 

Matches 112; Conservative 69; Mismatches 149; Indels 76; Gaps 8; 

Qy 29 DEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVALVGNTLVCLAVWRNHHMRTVTNYFIVNL 88 

::|:: |: |: III : | ::||| :| : : IIIIIMI:|II 

Db 23 NQFVQPTWQIVLW AAAYTVIWTSWGNVWIWIILAHKRMRTVTNYFLVNL 74 



Qy 89 SLADVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAI 148 

: I: : I : : I :| II : :::::::: I I I : || 

Db 75 AFAEACMAAFNTVWFTYAWNVWYYGLFYCKFHNFFPIAALFASIYSMTAVAFDRYMAI 134 

Qy 149 CHPLLFKSTARRARGSILGIWAVSLAIMVPQA AVMECSSVLPELANRTRLF 199 

III : : I : I I I I : I I : : I II III 
Db 135 IHPLQPRLSATATKWIFVIlflmJVLLLAFPQGYYSTTETMPSRWCMIEWPEHPNRT 191 

Qy 200 SVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWK 259 

I I I I I :: I I I :: I I : II : I I I : I 
Db 192 YEKAYHICVTVLIYFLPLLVIGYAYTWGITLWASEIPGDSSDRYH 237 

Qy 260 RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMIJWVLLVFALCYLPISVLW 319 

: I : I : II II : : I I : I I : I : I I I : 
Db 238 EQVSAKRKWKMMIVWCTFAICWLPFHVFFL 269 

Qy 320 LKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCC-LP 378 

I : ^ : I I III : : : I I II I I : : I I I I I I I I 

Db 270 LPYINPDLYLKKFIQQVYLA SMWLAMSSTMYNPIIYCCLNDRFRLGFKHAFRCCPFI 326 

Qy 379 GLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSVTTVL 424 

I I : II II : : M I : I : I : : : I I : 

Db 327 SAGDYEGLEMKSTR YLQTQSSVYKVSR— LETTISTW 362 



RESULT 9 
S17783 

tachykinin receptor homolog DTKR - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 22-Nov-1996 #sequence_revision 22-Nov-1996 #text_change 09-Jul-2004 
C;Accession: S17783 

R;Li, X.J,; Wolfgang, W. ; Wu, Y.N.; North, R.A. ; Forte, M. 
EMBO J. 10, 3221-3229, 1991 

A; Title: Cloning, heterologous expression and developmental regulation of a 

Drosophila receptor for tachykinin-like peptides . 

A; Reference number: S17783; MUID: 92007772 ; PMID: 1717263 

A;Accession: S17783 

A;Molecule type: mRNA 

A; Residues: 1-519 <LIX> 

A; Cross-references: UNIPROT : P30975; EMBL:X62711; NID:g8505; PIDN : CAA44595 . 1; 
PID:g8506 

A; Note: the sequence from Fig. 2 is inconsistent with that from Fig. 1 in 

lacking 481-Gly 
C; Genetics : 

A; Gene: FlyBase : Takr99D 

A; Cross-references : FlyBase : FBgn0004622 
C; Super family : neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; neurotransmitter receptor; transmembrane 
protein 

Query Match 18.8%; Score 417.5; DB 2; Length 519; 

Best Local Similarity 28.1%; Pred. No. 2.3e-27; 

Matches 113; Conservative 62; Mismatches 154; Indels 73; Gaps 10; 

Qy 3 PSATPGAQMGVPPGSREPS PVPPDYED EFLRYLWRDYLY 41 

II II I I I I I I I : I I I 

Db 45 PCRTLARSSPYPPVSFNHSQTLSTDQPAVGDVEDAAEDAAASMETGSFAFWPWWRQVL- 103 



Qy .42 PKQYEWLIT^WAVFWALVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLWM 101 

I : : : : : I I I I : I I I I I.I I I I I I I I I I : I I : I : : : : 

Db 104 WSIL — FGGMVIVATGGNLIWWIVMTTKRMRTVTNYFIVNLSIADAMVSSLNVT 156 

Qy 102 ASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRA 161 

: : III II: :: :|: :| II |::||: II II : : I 
Db 157 FNYYYMLDSDWPFGEFYCKLSQFIAMLSICASVFTIJ^SIDRYVAIIRPLQPRMSKRCN 216 

Qy 162 RGSILGIWAVSLAIMVPQAAVMECSSV-LPELANRTRLFSVCDERWAD DLYPKIY 215 

I I I I I : I : I : III II II : I 

Db 217 LAIAAVIWLASTLISCPMMIIYRTEEVPVRGLSNRT VCYPEWPDGPTNHSTMESLY 272 

Qy 216 HSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGE 275 

: I : I 1 I : I : I : : : I I I : II 
Db 273 NILIIILTYFLPIVSMTVTYSRVGIELWGSK TIGE 307 

Qy 276 PQPRARAFIlAEWQMRARRKTAKMIJy[VVLLVFALCYLPISVL^^^ 335 

II :|: :|::|: I 1 : : I I : I : I I : I : I I : :| : 

Db 308 CTPR QVENVRSKRRVVKMMIVVVLIFAICWLPFHSYFIITSCYPAITEAPFIQE 361 

Qy 336 VYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCL 377 

: I : I I : I I M I I I : : : : I I II I II 

Db 362 LYLAI YWLAMSNSMYNPIIYCWMNSRFRYGFKMVFRWCL 400 



RESULT 10 
JQ1274 

neurokinin 1 receptor - human 

N;Alternate names: NK-1 receptor; substance P receptor 
C; Species: Homo sapiens (man) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 09-Jul-2004 
C;Accession: A41134; JQ1274; JH0478; S21188 

R;Gerard^ N.P.; Garraway, L.A. ; Eddy Jr., R.L.; Shows, T.B.; lijima, H.; Paquet, 
J.L.; Gerard, C. . 

Biochemistry 30, 10640-10646, 1991 

A; Title: Hiiman substance P receptor (NK-1) : organization of the gene, chromosome 
localization, and functional expression of cDNA clones. 
A; Reference number: A41134; MUID: 92031510; PMID: 1657150 
A; Access ion: A41134 
A; Molecule type: DNA 

A; Residues: 1-328 G 329-332 , 334-407 <GER> 

A; Cross-references: UNIPROT : P25103 ; GB:M76675; NID:gl89231 

A; Note: in the authors' translation 333-Gly is shown before residue 329 and, 
consequently, residues 329-332 are displaced one codon to the right 
R;Takeda, Y.; Chou, K.B.; Takeda, J.; Sachais, B.S.; Krause, J.E. 
Biochem. Biophys . Res. Commun. 179, 1232-1240, 1991 

A; Title: Molecular cloning, structural characterization and functional 

expression of the human substance P receptor. 

A; Reference number: JQ1274; MUID: 92028856; PMID: 1718267 

A;Accession: JQ1274 

A;Molecule type: mRNA 

A; Residues: 1-407 <TAK1> 

A; Cross-references: GB:M74290; NID:g338612; PIDN: AAA60601 . 1; PID:g338613 
R; Hopkins, B. ; Powell, S.J.; Banks, P.; Briggs, I.; Graham, A. 
Biochem. Biophys. Res. Commun. 180, 1110-1117, 1991 

A; Title: Isolation and characterization of the human lung NK-1 receptor cDNA. 



A; Reference number: JH0478; MUID: 92062052; PMID: 1659396 
A;Accession: JH0478 
A;Molecule type: mRNA 
A; Residues: 1-407 <HOP> 

A; Cross-references: GB:S62045; NID:g237994; PIDN: AAB20168 . 1; PID:g237995 
A; Experimental source: lung 

A;Note: the authors translated the codon CAA for residue 31 as Glu 
R;Takahashi, K. ; Tanaka, A.; Kara, M. ; Nakanishi, S. 
Eur. J. Biochem. 204, 1025-1033, 1992 

A; Title: The primary structure and gene organization of human substance P and 
neuromedin K receptors . 

A; Reference number: S21188; MUID: 92201186; PMID: 1312928 

A; Accession: S21188 
A; Status: preliminary- 
A;Molecule type: DNA 
A; Residues: 1-407 <TAK2> 

A;Cross-references: GB:X65177; NID:g36636; PIDN:CAA46292 . 1; PID:g825721 

C;Comment: The endogenous ligand of this receptor is neurokinin 1 (substance P) , 

one of the peptides in the mammalian tachykinin system. 

C; Genetics : 

A; Gene: GDB:TAC1R 

A/Cross-references: GDB: 128977; OMIM: 162323 

A;Map position: 2pter-2qter 

C; Super family: neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 

transmembrane protein 

F;32-58/Domain: transmembrane #status predicted <TM1> 

F;69-92/Domain: transmembrane #status predicted <TM2> 

F; 110-128/Domain: transmembrane #status predicted <TM3> 

F; 149-168/Domain: transmembrane #status predicted <TM4> 

F; 195-221/Domain: transmembrane #status predicted <TM5> 

F;249-273/Domain: transmembrane #status predicted <TM6> 

F;286-308/Domain: transmembrane #status predicted <TM7> 

F;14, 18/Binding site: carbohydrate (Asn) (covalent) #status predicted 

F;105-180/Disulfide bonds: ttstatus predicted 

Query Match 18.8%; Score 417; DB 1; Length 407; 

Best Local Similarity 28.0%; Pred. No. 2e-27; 

Matches 110; Conservative 69; Mismatches 152; Indels 62; Gaps 10; 



Qy 


42 


PKQY EWVLI AAYVAVFWALVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLV 

II: 1 : : III : | : : I jj : 1 : : 1 1 1 II 1 1 1 : 1 II : 1 : : 
PNQFVQPAWQI VLWAAAYTVI VVT S VVGIWVVMWI I LAHKRMRT WN YFLW^ 


95 


Db 


22 


81 


Qy 


96 


TAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFK 

1 : : 1 : 1 II : : 1 :::::: 1 II : II 1 II : 
AAFNTWNFTYAVHNEWYYGLFYCKFHNFFPIAAVFASIYSMTAVAFDRYMAIIHPLQPR 


155 


Db 


82 


141 


Qy 


156 


STARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERWAD DLYP 

: 1 : 1 1 1 : : 1 : 1 1 1 : : 1 II 1 : : 1 
LSATATKWICVIWVLALLLAFPQGYY STTETMPSRV VCMIEWPEHPNKIYE 


212 


Db 


142 


193 


Qy 


213 


KIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGL 
1 : 1 1 1 : : 1 1 1 : : 1 1 : 1 1 : I 1 1 : 1 

KVYHI CVTVLI YFLPLLVI GYAYTWGI TLWAS EI PGDS S DRYH 


272 


Db 


194 


237 


Qy 


273 


SGEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLNVLKRWG 


332 



: I : I : I I I I :: I I : I I : I : I I : : I : 
Db 238 EQVSAKRKWKMMIVWCTFAICWLPFHIFFLLPYINPDLYLKKF 282 

Qy 333 REAWACFTFSHWLW7\NS/iANPIIYNFLSGKFREQFKAAFSCC-LPGLGPCGSLKAPS 391 

: I I II : : : I I I I I I : : M II I I I I I I : I 

Db 283 IQQVYLAIM WLAMSSTMYNPIIYCCLNDRFRLGFKHAFRCCPFISAGDYEGLEMKST 339 

Qy 392 RSSASHKSLSLQSRCSVSKISEHWLTSVTTVL 424 

I II:: II I:| : |:::||: 

Db 340 R YLQTQGSVYKVSR — LETTISTW 362 



RESULT 11 
S20304 

neurokinin 1 receptor - mouse 

N;J\lternate names: NK-1 receptor; substance P receptor 
C; Species: Mus musculus (house mouse) 

C;Date: 22-Nov-1993 #sequence_revision lO-Nov-1995 #text_change 09-Jul-2004 
C;Accession: 320304; 156216; 173044 

R;Sundelin^ J.B.; Prowedini, D.M.; Wahlestedt, C.R.; Laurell, H. ; Pohl, J.S.; 

Peterson, P. A. 

Eur. J. Biochem. 203, 625-631, 1992 

A; Title: Molecular cloning of the murine substance K and substance P receptor 
genes . 

A; Reference number: S20303; MUID: 92137253; PMID: 1370937 
A; Accession: S20304 
A; Status: preliminary 
A;Molecule type: mRNA 

A; Residues: 1-4 07 <SUN> > 

A; Cross-references: UNIPROT : P30548 ; GB:X62934; NID:g54206; PIDN : CAA44707 . 1; 
PID:g54207 

R;Cook, G.A. ; Elliott, D.; Metwali, A.; Blum, A.M.; Sandor, M. ; Lynch, R. ; 
Weinstock, J.V. 

J. Immunol. 152, 1830-1835, 1994 

A; Title: Molecular evidence that granuloma T lymphocytes in murine 
schistosomiasis mansoni express an authentic substance P (NK-1) receptor. 
A; Reference number: 156216; MUID: 94165478 ; PMID: 8120392 

A;Accession: 156216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 63-290 <C001> 

A; Cross-references: GB:L27826; NID:g450286; PIDN:AAA17891 . 1 ; PID:g480776 
A; Experimental source: tissue brain 
A; Accession: 173044 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 63-290 <C002> 

A; Cross-references: GB:L27828; NID:g450290; PIDN:AAA17892 . 1; PID:g480778 
A; Experimental source: tissue granuloma 
C;Superf amily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; transmembrane protein 

Query Match 18.8%; Score 417; DB 2; Length 407; 

Best Local Similarity 27.3%; Pred. No. 2e-27; 

Matches 111; Conservative 70; Mismatches 149; Indels 76; Gaps 8; 



Qy 29 DEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVALVGNTLVCLAVWRNHHMRTVTNYFIVNL 88 



::|:: |: |: III : I ::||| :l : : lllllllhlll 

Db 23 NQFVQPTWQIVLW AAAYTVIWTSWGNVWIWIILAHKRMRTVTNYFLVNL 74 

Qy 89 SLADVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAI 148 

: I : : I : : I : I II : :::::::: I I I : II 

Db 75 AFAEACMAAFNTWNFTYAVHNVWYYGLFYCKFHNFFPIAALFASIYSMTAVAFDRYMAI 134 

Qy 149 CHPLLFKSTTVRRTU^GSILGIWAVSLAIMVPQA-: AVMECSSVLPELANRTRLF 199 

III : : I : I I I :: I : II : : I M I I I 
Db 135 IHPLQPRLSATATKWIFVIWVLALLLAFPQGYYSTTETMPSRWCMIEWPEHPNRT 191 

Qy 200 SVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWK 259 

I I II I I II M : II :|ll :| 
Db 192 YEKAYHICVTVLIYFLPLLVIGYAYTWGITLWASEIPGDSSDRYH 237 

Qy 260 RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLNV 319 

: I : I : II I I :: I I : I I : I : II : : 
Db 238 EQVSAKRKWKMMIVWCTFAICWLPFHIFFL 269 

Qy 320 LKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFK7\AFSCC-LP 378 

I : : II I I I : : : II II I I : : M I I II I I 

Db 270 LPYINPDLYLKKFIQQVYLA SMWLAMSSTMYNPIIYCCLNDRFRLGFKHAFRCCPFI 326 

Qy 379 GLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSVTTVL 424 

I I : II I I : : I I I : I : I : : : I I : 

Db 327 SAGDYEGLEMKSTR YLQTQSSVYKVSR-— LETT 1ST W 362 



RESULT 12 
151898 

cholecystokinin A receptor - guinea pig 
C; Species: Cavia porcellus (guinea pig) 

C;Date: 04-Sep-1997 #sequence_revision 04-Sep-1997 #text_change 09-Jul-2004 

C;Accession: 151898 

R;De Weerth, A.; Pisegna, J.R.; Wank, S.A. 
Am. J. Physiol. 265, G1116-G1121, 1993 

A; Title: Guinea pig gallbladder and pancreas possess identical CCK-A receptor 

subtypes: receptor cloning and expression. 

A; Reference number: 151898; MUID: 94106629; PMID:7916580 

A; Accession: 151898 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-430 <RES> 

A; Cross-references: UNIPROT: Q63931; GB:S68242; NID:g544723; PIDN: AAB29504 . 1; 

PID:g544724 

C; Super family: neurokinin 1 receptor 

Query Match 18.8%; Score 417; DB 2; Length 430; 

Best Local Similarity 28.4%; Pred. No. 2.1e-27; 

Matches 122; Conservative 81; Mismatches 169; Indels 58; Gaps 13; 

Qy 16 GSREPSPVPPDYEDEFLRYLWRDYLYPK-QYEW VLIAAYVAVFWALVGNTLVCLAV 71 

II I : I : I I II I : II M i : I : : : :: I I I I I : 

Db 11 GSNITSACELGFENETLFCLDR PRPSKEWQPAVQILLYSLIFLLSVLGNTLVITVL 66 

Qy 72 WRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVS 131 

II I I I I I I I : : : I : : : I : : : hi : I : = ' --W I ^ I I I I I I 



Db 



67 IRNKRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPSLLKDFIFGSAVCKTTTYFMGTSVS 126 



Qy 132 VAVLTLSFIALDRWYAICHPLLFK— STARRARGSILGIWAVSLAIMVPQAAVMECSSVL 189 

I: I |:|:|: III II : I I I I :| II I h:: 
Db 127 VSTFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLV 183 

Qy 190 PELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGmAMA^ 248 

I I : ::| :|: : :|: :: :| I :| :|l I :|: I : 

Db 184 PFTKNNNQTGNMCRFLLPNDVMQQTWHTFLLLILFLIPGIVMMVAYGLISLELYQGIKFD 243 

Qy 249 GTTSALVRNWKRPSDQLGDLEQG LSGEPQPR7VRAFLAEVKQ 289 

: I : I :| I II l::l 
Db 244 AIQKKSAKERKTSTGSSGPMEDSDGCYLQKSRHPRK LELRQLSPSSSGSNRINRIR 299 

Qy 290 MRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTF 342 

: j: : : : || : I : : : : I I I : : I I I : : : : : I =1 

Db 300 SSSSTANLIVIAKKRVIRMLIVTVVLFFLCWMPIFSANAWRAYDTV SAERHLSGTPISF 356 

Qy 343 SHWLVYANSAANPIIYNFLSGKFREQFKAAFSCC LPGL-GPCG SLKAPSP 391 

I I : I I II II I :: : I I I I I I I I I : I . I : I 

Db 357 ILLLSYTSSCVNPIIYCFMNKRFRLGFMATFPCCPNPGTPGVRGEMGEEEEGRTTGASLS 416 

Qy 392 RSSASHKSLS 401 

I I I i I I 
Db 417 RYSYSHMSTS 426 



RESULT 13 

JQ1517 

neurokinin 3 receptor - human 

N;Alternate names: neuromedin K receptor; NK-3 receptor 
C;Species: Homo sapiens (man) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 09-Jul-2004 

C;Accession: JQ1517; S20435; S21237 

R;Huang, R.R.C.; Cheung^ A.H.; Mazina, K.E.; Strader, C.D.; Fong, T.M. 
Biochem. Biophys. Res. Commun. 184, 966-972, 1992 

A; Title: cDNA sequence and heterologous expression of the human neurokinin-3 
receptor . 

A; Reference number: JQ1517; MUID: 92246993; PMID: 1374246 
A;Accession: JQ1517 

A; Status: nucleic acid sequence not shown 

A; Molecule type: mRNA 
A; Residues: 1-465 <HUA> 

A; Cross-references: UNIPROT: P29371; GB:M89473; NID:gl89223; PIDN: AAA36366. 1; 
PID:gl89224 

A; Experimental source: brain 

R;Buell, G. ; Schulz, M.F.; Arkinstall, S.J.; Maury, K. ; Missotten, M. ; Adami, 
N.; Talabot, F.; Kawashima, E. 
FEBS Lett. 299, 90-95, 1992 

A; Title: Molecular characterisation, expression and localisation of human 
neurokinin-3 receptor. 

A;Reference number: S20435; MUID: 92183914; PMID: 1312036 

A;Accession: S20435 
A; Status: preliminary 
A; Molecule type: DNA 

A;Residues: 1-2, * I 4-62, ' R* , 64-465 <BUE> 

A; Cross-references: GB:S86392; NID:g246908; PIDN: AAB2 1706. 1; PID:g246909 



R;Takahashi, K. ; Tanaka, A.; Hara, M. ; Nakanishi, S. 
Eur. J. Biochem. 204, 1025-1033, 1992 

A; Title: The primary structure and gene organization of human substance P and 
neuromedin K receptors. 

A;Reference number: S21188; MUID: 92201186; PMID:1312928 

A;Accession: S21237 

A; Status: preliminary. 

A;Molecule type: DNA 

A;Residues: 1-438 F 440-465 <TAK> 

A; Cross-references: GB:X65172; NID:g35022; PIDN : CAA46291 . 1 ; PID:g825695 

C; Comment: The endogenous ligand of this receptor is neurokinin 3 (neuromedin 

K) , one of the peptides in the mammalian tachykinin system. 

C; Genetics: 

A; Gene: GDB:TACR3 

A; Cross-references : GDB: 9599126 

C;Superf amily: neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 

transmembrane protein 

F; 83-111/Domain : transmembrane #status predicted <TM1> 
F; 122-147/Domain: transmembrane #status predicted <TM2> 
F; 160-181/Domain: transmembrane #status predicted <TM3> 
F;202-221/Domain: transmembrane #status predicted <TM4> 
F;247-272/Domain: transmembrane #status predicted <TM5> 
F; 300-321/Domain: transmembrane #status predicted <TM6> 
F;333-355/Domain: transmembrane #status predicted <TM7> 

F;23, 50, 73/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 158-233/Disulf ide bonds: #status predicted 



Query Match 18.7%; Score 416; DB 1; Length 465; 

Best Local Similarity 28.7%; Pred. No. 2.8e-27; 

Matches 108; Conservative 70; Mismatches 142; Indels 56; Gaps 10; 



Qy 


4 


SATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVG 


63 


Db 


52 


1 :: 1 1 : 1 : 1 1 II : : : 1 : : II 1 II II 1 II : : 1 
SSSPSA-LGLPVASPAPSQPWANLTNQFVQPSWRIAL— WSL— AYGVWAVAVLG 


102 


Qy 


64 


NTLVCIAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCKVIP 
1 : 1 : : 1 1 II 1 1 1 1 : 1 II :: 1 : 1 : : : III 1 : 
NLIVIWIILAHKRMRTVTNYFLVNLAFSDASMAAFNTLVNFIYALHSEWYFGANYCRFQN 


123 


Db 


103 


162 


Qy 


124 


YLQAVSVSVAVLTLS FX ALDRWYAI CHPLLFKSTARRARGS I LGI WAYS LAIMVPQAAVM 
: : 1 : : : : : 1 1 : 1 1 : 1 1 II : : 1 : 1 1 1 : : : II 
FFPITAVFASIYSMTAIAVDRYMAIIDPLKPRLSATATKIVIGSIWILAFLLAFPQCLYS 


183 


Db 


163 


222 


Qy 


184 


ECSSVLPELANRTRLFSVCDERWADDLYPK lYHSCFFIVTYLAPLGLMAMAYFQIFR 

: : 1 : 1 II : 1 : 1 : II II 1 : 1 1 1 : 1 : 1 : 
K-TKVMP GRT LCFVQWPEG— PKQHFTYHIIVIILVYCFPLLIMGITYTIVGI 


240 


Db 


223 


272 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRT^RKTAKML 
1 1 1 : 1 1 1 1 : 1 : : 1 : 1 1 1 1 : 
TLWGGEI PGDTCDKYH EQLKAKRKWKMM 


300 


Db 


273 


301 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 
: : 1 : : 1 1 : 1 : 1 1 : : 1 : : : : II III : : : 1 II 1 1 
IIWMTFAICWLPYHIYFILTAIYQQLNRWKYIQQVYLA SFWLAMSSTMYNPIIYCC 


360 


Db 


302 


358 


Qy 


361 


LSGKFREQFKAAFSCC 376 





I : : I I I I I I I 
Db 359 LNKRFRAGFKRAFRWC 374 



RESULT 14 
A34916 

neurokinin 3 receptor - rat 

N; Alternate names: neuromedin K receptor; NK-3 receptor 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 22-Jan-1993 #sequence_revision 22-Jan-1993 #text_change 09-Jul-2004 
C; Accession: A34916 

R;Shigeraoto, R. ; Yokota, Y. ; Tsuchida, K. ; Nakanishi, S. 
J. Biol. Chem. 265, 623-628, 1990 

A; Title: Cloning and expression of a rat neuromedin K receptor cDNA. 
A;Reference number: A34916; MUID: 90110113; PMID:2153106 
A;Accession: A34916 . 
A; Molecule type: mRNA 
A; Residues: 1-452 <SHI> 

A; Cross-references: UNIPROT: P16177; GB:J05189; NID:g205670; PIDN: AAA41688 . 1; 
PID:g205671 

C; Super family : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; membrane protein 

Query Match 18.7%; Score 414.5; DB 2; Length 452; 

Best Local Similarity 28.7%; Pred. No. 3.6e-27; 

Matches 106; Conservative 66; Mismatches 142; Indels 55; Gaps 8; 

Qy 11 MGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLA 70 

:|:| :: II I : ::|:: II I II II I ll: II :| 

Db 45 LGLPATTQAPSQVRANLTNQFVQPSWRIAL WSL — AYGLWAVAVFGNLIVIWI 96 

Qy 71 WRNHHMRTWNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSV 130 

: : I I I I II I I : I I I I II : : : I I I I ^ : - I 

Db 97 ILAHKRMRTVTNYFLVNLAFSDASVAAFNTLINFIYGLHSEWYFGANYCRFQNFFPITAV 156 

Qy 131 SVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLP 190 

: : : : : I I : I I : I I II : : I : I I I : : : II I : 

Db 157 FASIYSMTAIAVDRYMAIIDPLKPRLSATATKIVIGSIWILAFLLAFPQCLY SKIK 212 

Qy 191 ELANRTRLFSVCDERWADDLYPK lYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQI 247 

: II : I : I : II II I : I I I : I : I : I I I : I 

Db 213 VMPGRT LCYVQWPEG — PKQHFTYHIIVIILVYCFPLLIMGVTYTIVGITLWGGEI 266 

Qy 248 PGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIJ^VKQMRARRKTAKMLMVVLLVF 307 

Ml : I : : I : |. I I I : : : I : : I 

Db 267 PGDTCDKYH EQLKAKRKWKMMIIVWTF 295 

Qy 308 ALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFRE 367 

I : I : I I I : I : : : : I I III : : : I I I I I I : : I I 

Db 296 AICWLPYHVYFILTAIYQQLNRWKYIQQVYLA SFWLAMSSTMYNPIIYCCLNKRFRA 352 

Qy 368 QFKAAFSCC 376 

I I I I I 
Db 353 GFKRAFRWC 361 



RESULT 15 



156595 

neurokinin 2 receptor - guinea pig 

C; Species: Cavia porcellus (guinea pig) 

C;Date: 04-Sep-1997 #sequence_revision 04-Sep-1997 #text_change 09-Jul-2004 
C;Accession: 156595 

R;Aharony, D.; Little, J.; Thomas, C; Powell, S.; Downey- Jones , M. ; Graham, A. 
J. Recept. Res. 14, 399-421, 1994 

A;Title: Isolation and characterization of neurokinin A receptor cDNAs from 

guinea-pig lung and rabbit pulmonary artery. 

A; Reference number: 156595; MUID: 95182423; PMID:7877137 

A;Accession: 156595 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-402 <RES> 

A;Cross-references: UNIPROT : Q64077 ; GB:S76253; NID:g913274; PIDN: AAB33553 . 1; 
PID:g913275 

C; Super family: neurokinin 1 receptor 

Query Match 18.6%; Score 413.5; DB 2; Length 402; 

Best Local Similarity 29.7%; Pred. No. 3.9e-27; 

Matches 113; Conservative 64; Mismatches 134; Indels 69; Gaps 12; 

Qy 47 WVL lAAWAVFWALVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPAS 103 

II II : I : : II : II I : : I I I I I I I I I I I I : I I I :: I : 

Db 31 WQLALWATAYLALVLVAWGNATWWIILAHQRMRTWNYFIWLALADLCMAAFNA^ 90 

Qy 104 LLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARG 163 

: I II I I :: I:: ::: ||:||: II II : :| : 

Db 91 FVYASHNIWYFGR?lFCYFQNLFPITAMFVSIYSMTAIAIDRYMAIVHPFQPRLSAPSTKA 150 

Qy 164 SILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERWADDLYPK lYHSCFF 220 

I I I I I : II : . I I I : I I : I I : I I HI 
Db 151 VIGGIWLVALALAFPQCFY STITEDEGATK CWAWPEDSRDKSLLLYHLWI 202 

Qy ' 221 IVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRA 280 

:: II II :| :|| I II I : II 
Db 203 VLIYLLPLTVMFVAYSIIGLTLWRRAV PRH 232 

Qy 281 RAFIAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACF 340 

: I I : : :: I :: I | : : : | : : | | : | : | | : : I I : I : I I 

Db 233 QAHGANLRHLQAKKKFVKTMVLVWTFAICWLPYHLYFIL -GSFQEDIY-CH 282 

Qy 341 TFSH WLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRS 393 

I II : : I I II I I : : I I I : I I I I I : I : 

Db 283 KFIQQVYIJy.FWLAMSSTMYNPIIYCCLNRRFRSGFRLAFRCC-PWVTP TEEDKL 336 

Qy 394 SASH-KSLSLQ-SRCSVSKI 411 

:| I II: :|l :| 
Db 337 ELTHTPSFSLRVNRCHTKEI 356 



Search completed: October 14, 2004, 10:56:54 
Job time : 23.5986 sees 



GenCore version 5.1.6 
Copyright (c) 1993 ~ 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 14, 2004, 10:55:56 



; Search time 79.0952 Seconds 

(without alignments) 

1737.280 Million cell updates/sec. 



Title: 

Perfect score: 
Sequence : 



US-10-070-532-2 

2219 

1 MEPSATPGAQMGVPPGSREP 



CSVSKISEHWLTSVTTVLP 425 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1360919 seqs, 323318874 residues 



Total number of hits satisfying chosen parameters: 



1360919 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Published__Applications_AA: * 



1: /cgn2_6/ptodata/2/pubpaa/US07__PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB . pep : * 

5: /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 

7: /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep:* 

8: /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep:* 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB . pep : * 
10: /cgn2_6/ptodata/2/pubpaa/US09B__PUBCOMB.pep:* 
11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep:* 
12: /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep:* 
13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep:* 
14 : /cgn2__6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 
15: /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 
16: /cgn2_6/ptodata/2/pubpaa/US10D_PUBCOMB.pep:* 
17: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep:* 
18: /cgn2_6/ptodata/2/pubpaa/US60_NEW_.PUB.pep:* 
19 : /cgn2_6/ptodata/2 /pubpaa/US60 PUBCOMB . pep : * 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMT^IES 



Result 



% 

Query 



No. Score Match Length DB ID 



Description 
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Sequence 2, Appli 
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QQ 
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Sequence 


368, App 
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22, Appl 
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Sequence 


549, App 
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Sequence 


2, Appli 


y 


Z Uo / 
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1 
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lU 
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Sequence 


2, Appli 
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Sequence 2, Appli 


11 
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Q C 

o 0 . 


9 

1 


9 Q Q 
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A 

y 


TTQ — AQ— P9ft— R9ft — 9n 
Uo Uy OZO OjO ^lU 


Sequence 20, Appl 


Iz 


lyuz . O 


Q C 

oO . 


1 


"3 Q Q 

3o y 


1 A 
lU 


TTQ-.AQ_91 1 — 09 9^^ — 9 9 
Uo Uy ZXX DZoL/ ZJ 


Sequence 


23, Appl 


13 


1897 


o c 
OO . 


c 
O 


o Q 

3by 


13 


TTC_1A A99 — P9/1 — 4 
UblUU / /O /44 


Sequence 


4, Appli 


14 


loy4 


o c 
DO . 


4 


3 /Z 


T A 
lU 
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Uo Uy oyo Di'D D 


Sequence 


6, Appli 


lb 


ioy4 


Q C 
OO . 


4 


Q 99 

3 /Z 
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13 
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Sequence 


6, Appli 


Id 
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o4 . 


o 
Z 
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lU 
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Sequence 


4, Appli 


17 


14/4.0 


bo . 


4 


A ezr\ 
4bU 


1 A 

14 


TTC 1A API—PIA— 4i^ 
Ub— lU— UOl OlU— 4D 


Sequence 


46, Appl 


18 


1460 


c c 

bo . 


o 
o 


>i >l y| 
444 


1 A 
lU 


TTC AQ QQ9_991_1Q 

Uo— uy— y yz— Joi— ly 


Sequence 


1 A 7\ ^— 1 

19, Appl 


19 


14dO 


c c 
bo . 


o 
o 
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444 


1 A 

14 


TTC 1A AP1_P1A— 4C 
Uo— lU~U0l0lU40 


Sequence 


45, Appl 
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DO , 


Q 

o 
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14 
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O O A TV-w^^ 

37 0, App 


zl 


T /! ^A 

14 bO 
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bO . 


o 
o 
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444 


1 A 

14 
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Sequence 


19, Appl 


22 


1460 


DO . 


o 

o 


>t /I >i 
444 


1 A 

14 


TTQ T A Atf^A 9^Q 1 1 

Uo— lu— Ubu— 3by— 11 


Sequence 


11, Appl 


23 


1460 


bo . 


o 
o 


>i /I ^ 
444 


1 A 

14 


TTC 1A 190_1Q4—9 

Uo— lu— 1 /o— iy4— z 


Sequence 


2, Appli 


Z 4 


n /I A 
14 dU 


a c 
bo . 


Q 

o 
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444 


lb 


TTQ— 1 A — 9<^P — P9P — 1 Q 
Uo lU /DO 0 / O X 


Sequence 


19, Appl 


ZO 
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0 
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T A 
14 
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U o lU ZOZ / 1 / Z 


Sequence 


2, Appli 


ZD 
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b 
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lU 
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Sequence 


ool, App 


27 
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C A 
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1 
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14 
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Sequence 


04, Appl 
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Z O 
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o o 
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y 
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Sequence I 


B, Appli 


z y 
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ozz . O 


O Q 

Z 3 . 


c: 
O 


4 3U 


1 A 

14 


TTQ— 1 A — 9 9 R — 9 Z\— ^nRP 
Uo lU — ZZO — OD / /\— D O O 


Sequence 


658, App 


30 


coo c 
ozz . O 


o o 
Z3 . 


c 
0 


/I O A 

43U 


lb 


TTQ TA 91 Q RP9— R4 
Uo lU— /ly — OO / 04 


Sequence 


54, Appl 


31 


COO c 

OZZ , 0 


Z3 . 


c 
O 


A A 1 

4 41 


1 A 

14 


TTC — 1 A — 9 Q9 — 9 Q P— ft Q 0 

Uo— lu— zyz /yo— oyu 


Sequence 


890, App 


32 


C 1 A C 

519 . 5 


o o 

z3 . 


4 


/I o o 

4Z o 


y 


TTC AQ 9Q9 Q99 — 4 

Uo— uy— zyz— y /o— 4 


Sequence 


4, Appli 


33 


C A O C 

oO / . o 


o o 
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9 


/I "3 o 
43Z 


A 

y 


TTC AQ P^^ 94P7V — 9 

u 0— u y— 0 bb z 4 OA z 


Sequence : 


2, Appli 


34 
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o o 
ZZ . 


Q - 

y 


/I Q O 

43Z 


lb 
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Uo— lu / ly oo / 0 / 


Sequence 


37, Appl 
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y 
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6, Appli 
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9, Appli 
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c 
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14 
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Sequence 


53, Appl 


3o 


/I QQ ^ 

4 y y . 0 


9 9 
ZZ . 


c 
O 


^99 
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±4 


TTC-i 0-99R-Rfi7A— SI 9 


Sequence 


512, App 


39 


499.5 


22. 


5 


522 


14 


US-10-255-551-2 


Sequence 


2, Appli 


40 


499,5 


22. 


5 


522 


15 


US-10-072-012-360 


Sequence 


360, App 


41 


499.5 


22. 


5 


522 


15 


US-10-072-012-361 


Sequence 


361, App 


42 


499.5 


22. 


5 . 


522 


15 


US-10-276-774-2093 


Sequence 


2093, Ap 


43 


499.5 


22, 


5 


522 


17 


US-10-757-262-8 


Sequence 


8, Appli 


44 


499 


22. 


5 


417 


15 


US-10-072-012-358 


Sequence 


358, App 


45 


480 


21. 


6 


426 


9 


US-09-292-973-19 


Sequence 


19, Appl 



ALIGNMENTS 



RESULT 1 

US-09-961-848-2 

; Sequence 2, Application US/09961848 
; Patent No. US20020146719A1 
; GENERAL INFORMATION: 



; APPLICANT: Berglind Ran Olafsdottir 

; APPLICANT: Jeffrey Gulcher 

; TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 

; FILE REFERENCE: 2345.1005-004 

; CURRENT APPLICATION NUMBER: US/ 09/961, 848 

; CURRENT FILING DATE: 2001-09-24 

; PRIOR APPLICATION NUMBER: US 09/479,128 

; PRIOR FILING DATE: 2000-01-07 

; PRIOR APPLICATION NUMBER: US 09/379,083 

; PRIOR FILING DATE: 1999-08-23 

; NUMBER OF SEQ ID NOS : 22 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 2 
; LENGTH: 425 

TYPE: PRT 
; ORGANISM: Homo Sapiens 
US-09-961-848-2 



Query Match 100.0%; Score 2218; DB 9; Length 425; 

Best Local Similarity 99.8%; Pred. No. 5.8e-204; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIII 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

IIIIIMIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIII 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I i I N M I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M 

Db 301 MVVLLVFALCYLPISVLNAa.KRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

II Mill III II I III I I llllll III III III I III I III I IIMIH MINI III II 

Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



Qy 421 TTVLP 425 

I II II 

Db 421 TTVLP 425 



RESULT 2 

US-09-828-538-24 



; Sequence 24, Application US/09828538 

; Patent No. US20010025031A1 

; GENERAL INFORMATION: 

; APPLICANT: Ellis, Catherine E. 

; APPLICANT: Kwok, Cheni 

; APPLIC7\NT: Bodsworth, Nicola J. 

; APPLICANT: Halsey, Wendy 

; APPLICANT: Van Horn, Stephanie 

; TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 

; TITLE OF INVENTION: of Use Thereof in Diagnostic ^^plications 

; FILE REFERENCE: GH-50038-C1 

; CURRENT APPLICATION NUMBER: US/09/828, 538 

; CURRENT FILING DATE: 2001-04-06 

PRIOR APPLICATION NUMBER: 60/088,624 
; PRIOR FILING DATE: 1998-06-08 
; PRIOR APPLICATION NUMBER: 60/093,726 
; PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: 09/328,014 
; PRIOR FILING DATE: 1999-06-08 
; NUMBER OF SEQ ID NOS: 24 

; SOFTWT^KE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 24 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-24 



Query Match 99.8%; Score 2215; DB 9; Length 425; 

Best Local Similarity 99.8%; Pred. No. l.le-203; 

Matches 424; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MEP SAT PGAQMGVP PGS REP S PVP PD YEDEFLRYLWRD YLYPKQ YEWVLIAAYVAVFWA 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLP7VSLLVDITESWLFGHALCK 


120 


s. 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMMII 




Db 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 M 1 




Db 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


181 


AVMECSSVLPEIJ\NRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 i 1 




Db 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQAS0REAVYACFTFSHWLVYANSA7\NPI I YNF 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 3 

US-09-828-538-19 

Sequence 19, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION : 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 
CURRENT APPLICATION NUMBER: US/09/828,538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 19 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-19 

Query Match 99.8%; Score 2214; DB 9; Length 425; 

Best Local Similarity 99.5%; Pred. No. 1.4e-203; 
. Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIT^YVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I i I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPEL7\NRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 



Db 



241 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 



300 



Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I 1. 1 I I I I I I I I I I I I I I I I I I I I I I I I I I M I : I I I I I I I M I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLK7VPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 4 

US-09~211-823C-22 

Sequence 22, Application US/09211823C 
Publication No. US20030087801A1 
GENERAL INFORMATION: 
APPLIC7\NT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 

TITLE OF INVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS 
TITLE OF INVENTION: OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
TITLE OF INVENTION; ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 22 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-22 

Query Match 99.8%; Score 2214; DB 10; Length 425; 

Best Local Similarity 99.5%; Pred. No. 1.4e-203; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLI7\AYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I i I I I I I I I I I II I M 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYMCHPLLFKSTARRARGSILGIWAVSIAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYIAPLGLM7\m 240 

I I I I II I I I I I II I I II I II I I I I I II I I I I I I I II I I I I I II I II M II I II II II M I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAM^ 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I II II II I II I I II II I I II I I II II I I II I I II I I II I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I M M I I M I I I I M I I I 
Db 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFK7U\FSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWXTSV 420 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I II I I 

Db 421 TTVLP 425 



RESULT 5 

US-10-225'-567A-368 

; Sequence 368, Application US/10225567A 

; Publication No. US20030113798A1 

; GENERAL INFORMATION: 

; APPLICANT: Lifespan Biosciences 

; APPLICANT: Brown, Joseph P. 

; APPLICANT: Burmer, Glenna C. ' 

; APPLICANT: Roush, Christine L. ■ 

; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN-COUPLED 

RECEPTORS (GPCRS) 

; FILE REFERENCE: 1920-4-4 

; CURRENT APPLICATION NUMBER: US/10/225, 567A 
; CURRENT FILING DATE: 2001-12-19 
; PRIOR APPLICATION NUMBER: 60/257,144 
; PRIOR FILING DATE: 2000-12-19 
; NUMBER OF SEQ ID NOS : 2292 
; SOFTWARE: Patentin version 3.1 
; SEQ ID NO 368 
LENGTH: 425 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-225-567A-368 

Query Match 99.8%; Score 2214; DB 14; Length 425; 

Best Local Similarity 99.5%; Pred. No. 1.4e-203; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 


61 


LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAIGLPASLLVDITESWLFGHALCK 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I.I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLWAICLPASLLVDITESW 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


VI PYLQAVSVS VAVLTLS FX ALDRWYAI CHPLLFKSTARI^ARGS I LGIWAVS LAIMVPQA 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 fli 


ax/MFr^^c-vT.ppT.AMRTRT.FSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


4 -L 


KT wnRO T P fiTT S AT.VRNWKRP S DOLGDLEOGL S GE PO P RGRAFLAEVKQMRARRKT AKML 


300 


Qy 


301 


^]WLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 


360 




O D X 


T snKFRFOFKAAFSCrT.PGLGPCGSLKAPSPRSSASHKSLSLOSRCSVSKISEHWLTSV 


420 




1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 M 1 1 1 1 1 1 




Db 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 


TTVLP 425 




Db 


421 


1 1 1 1 1 

TTVLP 425 





RESULT 6 

US-10-352-684A-22 

Sequence 22, Application US/10352684A 
Publication No. US20030215452A1 
GENERAL INFORMATION: 
APPLICANT: Millennium Pharmaceuticals Inc. 
APPLICANT: Carroll, Joseph M. 
APPLICANT: Healy, Aileen 
APPLICANT: Welch, Nadine S. 
APPLICANT: Kelly, Louise M. 

TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TREATING 
TITLE OF INVENTION: HEMATOLOGICAL DISORDERS USING 131, 148, 199, 12303, 
13906, 

; TITLE OF INVENTION: 15513, 17822, 302, 5677, 194, 14393, 28059, 7366, 12212, 
; TITLE OF INVENTION: 1981, 261, 12416, 270, 1410, 137, 1871, 13051, 1847, 
1849, 

TITLE OF INVENTION: 15402, 340, 10217, 837, 1761, 8990 OR 13249 MOLECULES 
FILE REFERENCE: MPI02-019P1RNOMNIM 
CURRENT APPLICATION NUMBER: US/10/352 , 684A 
CURRENT FILING DATE: 2003-01-28 
PRIOR APPLICATION NUMBER: US 60/354,333 
PRIOR FILING DATE: 2002-02-04 
PRIOR APPLICATION NUMBER: US 60/360,258 
PRIOR FILING DATE: 2002-02-28 
PRIOR APPLICATION NUMBER: US 60/364,476 
PRIOR FILING DATE: 2002-03-15 
PRIOR APPLICATION NUMBER: US 60/375,626 



; PRIOR FILING DATE: 2002-04-26 

; PRIOR APPLICATION NUMBER: US 60/386,494 

; PRIOR FILING DATE: 2002-06-06 

; PRIOR APPLICATION NUMBER: US 60/390,965 

; PRIOR FILING DATE: 2002-06-24 

; PRIOR APPLICATION NUMBER: US 60/392,480 

; PRIOR FILING DATE: 2002-06-28 

; PRIOR APPLICATION NUMBER: US 60/394,128 

; PRIOR FILING DATE: 2002-07-03 

; PRIOR APPLICATION NUMBER: US 60/399,783 

; PRIOR FILING DATE: 2002-07-31 

; PRIOR APPLICATION NUMBER: US J50/403,221 

PRIOR FILING DATE: 2002-08-13 
; Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS: 62 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 22 
; LENGTH: 425 . 
TYPE: PRT 

ORGANISM: Homo Sapiens 
US-10-352-684A-22 



Query Match 99.8%; Score 2214; DB 14; Length 425; 

Best Local Similarity 99.5%; Pred. No. 1.4e-203; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



QY 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


121 


VI P YLQAVSVSVAVLTLS FIALDRW YAI CHPLLFKSTARRARGS I LGI WAVSLAIMVPQA 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWTUDDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQiPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 




1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


•241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 


360 




1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 




Db 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVY7\NSAANPI I YNF 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 M 1 




Db 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Qy 


421 


TTVLP 425 




Db 


421 


1 1 1 1 1 

TTVLP 425 





RESULT 7 

US-09-826-509-549 

Sequence 549, Application US/09826509 
Publication No. US20030204073A1 
GENERAL INFORMATION: 
APPLICANT: Lehmann-Bruinsma, Karin 
APPLICANT: Liaw, Chen W. 
APPLICANT: Lin, I-Lin 

TITLE OF INVENTION: No. US20030204073Al-Endogenous , Constitutively Activated 
Known G 

TITLE OF INVENTION: Protein-Coupled Receptors 
FILE REFERENCE: AREN-2 07 

CURRENT APPLICATION NUMBER: US/09/826,509 
CURRENT FILING DATE: 2001-04-05 
PRIOR APPLICATION NUMBER: 60/195,747 
PRIOR FILING DATE: 2000-04-07 
PRIOR APPLICATION NUMBER: 09/170,496 
PRIOR FILING DATE: 1998-10-13 
NUMBER OF SEQ ID NOS : 589 
SOFTWARE: Patentin Version 2.1 
SEQ ID NO 549 
LENGTH: 425 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-826-509-549 

Query Match 99.5%; Score 2209; DB 10; Length 425; 

Best Local Similarity 99.3%; Pred. No. 4.3e-203; 

Matches 422; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 
I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I. I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 



I I I I I I I I I I I M I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 



180 



AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

II I I I M I M I I I I M I II I 1! II I I I I I I I I I I I I I II I II I II I I I I I I I M I I M I I 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMR?iRRKTAKML 300 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II II II III 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTKKML 300 

MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANSAANP 1 1 YNF 360 
I I I I I I I II I II I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I II II II 
MWLLVFALC YLPI S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN PIIYNF 360 



Qy 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 8 

US-10-077-874-2 

; Sequence 2, Application US/10077874 
; Publication No. US20020115155A1 

GENERAL INFORMATION: 
; APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 

NUMBER OF SEQUENCES: 12 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Human Genome Sciences, Inc. 

; STREET: 9410 Key West Avenue 

CITY: Rockville 
; STATE: MD 

; COUNTRY: USA 

ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTW/sJlE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/077, 874 
; FILING DATE: 20-Feb-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
; FILING DATE: 05-JUNE-1995 

ATTORNEY/ AGENT INFORMATION: 

NAME: Wales, Michele M. 
; . REGISTRATION NUMBER: 43,975 

; REFERENCE/DOCKET NUMBER: PF168P1D1 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 402 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-077-874-2 

Query Match 95.0%; Score 2108; DB 13; Length 402; 

Best Local Similarity 100.0%; Pred. No. 1.9e-193; 

Matches 402; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

MM I I III II I Mill I Mill I II III III lllllll I II III II I II null Mil 



Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 61 LVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGIJy^^ 240 

Qy . 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 9 
US-09-393-696-2 

Sequence 2, Application US/09393696 
Publication No. US2 0030022277A1 
GENERAL INFORMATION: 
APPLICANT: Human Genome Sciences, Inc. et al. 
TITLE OF INVENTION: Human Neuropeptide Receptor 
FILE REFERENCE: PF168P2 

CURRENT APPLICATION NUMBER: US/09/393,696 
CURRENT FILING DATE: 1999-09-10 
EARLIER APPLICATION NUMBER: PCT/US95/05616 
EARLIER FILING DATE: 1995-05-05 
EARLIER APPLICATION NUMBER: US08/462,509 
EARLIER FILING DATE: 1995-06-05 
NUMBER OF SEQ ID NOS : 23 
SOFTWARE: Patent In Ver. 2.0 
SEQ ID NO 2 
LENGTH: 402 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-393-696-2 

Query Match 94.1%; Score 2087; DB 10; Length 402; 

Best Local Similarity 99.0%; Pred. No. 2e-191; 

Matches 398; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 
Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIPAYVAVFWA 60 



Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIJU5VLVTAICLPASLLVDITESWLFGHALCK 120 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSIADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I i I M I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYIAPLGm^^ 24 0 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 181 AVMECSSVLPELANRTRLFSVCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPR7VRAFIAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db .241 NLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 IWVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 10 
US-09-730-931-2 

; Sequence 2, Application US/09730931 

; Patent No. US20020064814A1 

; GENERAL INFORMATION: 

; APPLICANT: ELLIS, CATHERINE E. 

; TITLE OF INVENTION: DOG OREXIN 1 RECEPTOR 

; FILE REFERENCE: GH-70669 

; CURRENT APPLICATION NUMBER: US/09/730,931 

; CURRENT FILING DATE: 2000-12-06 

; PRIOR APPLICATION NUMBER: 60/169,373 

; PRIOR FILING DATE: 1999-12-07 

; NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 427 

TYPE: PRT 
; ORGANISM: CANIS FAMILIARIS 
US-09-730-931-2 

Query Match 94.1%; Score 2087; DB 9; Length 427; 

Best Local Similarity 94.1%; Pred. No. 2.2e-191; 

Matches 402; Conservative 5; Mismatches 18; Indels 2; Gaps 1; 

Qy 1 MEPSATPGAQMGVPPGSRE — PSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFV 58 

I I I I I I I I i I I I I I II I I II I II I I I I I II I I I I II I I I I I II I I I II I I I : 
Db 1 MEPSATPGAQTGTPTGGGELSPSLVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFL 60 

Qy 59 VALVGNTLVCLAVWRNHHMRTVTNYFIVNLSL7U:)VLVTAICLPASLLVDITESWLFGHAL 118 

I I I I I I I I l-l II I I II I II I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I M I I I I I I 
Db * 61 VALVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHTL 120 



Qy 119 CWIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRT^GSILGIWAVSLAIMVP 178 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I : I I I 
Db 121 CKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARSSILGIWAVSLAVMVP 180 

Qy 179 QAAVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLM^^ 238 

I I I I I I I I I I I I I I I I I I i I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 181 QAAVMECSSVLPEIJ^RTRLFSVCDEHWADDLYPKIYHSCFFIVTYLAPLGIJyr^ 240 

Qy 239 FRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAK 298 

I I I I I I I I I M I I I I I I I I I I I I I I I I I III II M II I I I I I II I I M I I I M I 

Db 241 FRKLWGRQIPGTTSALVRNWKRPSDQLEDQGPGLSAEPPPRARAFLAEVKQMRARRKTAK 300 



Qy 299 MIJWVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIY 358 

I I I I : I I II II II I II I II I I I I I I I I I II I : I I I I I I I I II II II I I I I II I I I II I I I 

Db 301 MLM\n^LLVFALCYLPISVLNVLKRVFGMFRQSSDREAWACFTFSHWLVYANSAANPIIY 360 

Qy 359 NFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLT 418 

II I I II I I II II I I II I I I I I II I I II II I II I I I I I II I I I I I I I I I I I : II M M 

Db 361 NFLSGKFREQFKAAFSCCLPGLGPCGSPKAPSPRSSASHKSLSLHSRCSVSKVPEHWLT 420 

Qy 419 SVTTVLP 425 

I I I I I I I 

Db 421 SVTTVLP 427 



RESULT 11 
US-09-828-538-20 

Sequence 20, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT : Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 
CURRENT APPLICATION NUMBER: US/09/828,538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 20 
LENGTH: 389 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-20 



Query Match 85.7%; 
Best Local Similarity 96.1%; 
Matches 368; Conservative 



Score 1902.5; DB 9; Length 389; 
Pred. No. 9.7e-174; 
2; Mismatches 10; Indels 3; Gaps 2; 



Qy 

Db 



1 
1 



60 



60 



Qy 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVXVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLWAICLPASLLVDITESWL 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I III I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

II I I I I I II I I I I I I I I I I I II I I II I I I I II I M I I II I I II I II I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLISrVXKRVFGMFRQASDREAWACFTFSHWLWAN 360 

I I I M II I I II I I I I II M I II M II II I II M II I II I II I I II II I I I II I I II I II I 
Db 301 MVVLLVFALCYLPISVL^r\^.KRVFGMFRQASDREAVYACFTFSHWLWANSi^ 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

III : : I I I I I I 

Db 361 LSG~CKEKSLALSCPSCPGHDP 381 



RESULT 12 
US-09-211-823C-23 

Sequence 23, Application US/09211823C 
Publication No. US20030087801A1 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 

TITLE OF INVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS 
TITLE OF INVENTION: OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P5a745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS: 23 

SOFTWT^E: FastSEQ for Windows Version 3.0 
SEQ ID NO 23 
LENGTH: 389 
TYPE: PRT 



; ORGANISM: HOMO SAPIENS 
US-09-211-823C-23 



Query Match 85.7%; Score 1902.5; DB 10; Length 389; 

Best Local Similarity 96.1%; Pred. No. 9.7e-174; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALC^ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I i I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSIAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I 

Db 181 AVMECSSVLPEL7\NRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

III :: III II I 

Db 361 LSG— CKEKSLALSCPSCPGHDP 381 



RESULT 13 
US-10-077-874-4 

Sequence 4, Application US/10077874 
Publication No. US20020115155A1 
GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockville 
STATE: MD 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 



APPLICATION NUMBER: US/10/077,874 
FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/ DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEF7\X: 301-309-8439 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 369 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
US-10-077-874-4 

Query Match 85.5%; Score 1897; DB 13; Length 369; 

Best Local Similarity 99.7%; Pred. No. 3.1e-173; 

Matches 362; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 
MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I M I M I I 
LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

AVMECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I I 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
MWLLVFALC YLP I S VLNVLKRVFGMFRQASDREAVYACFT FSHWLVYANSAANP 1 1 YN F 360 



I I I 



Qy 


1 


Db ^ 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



RESULT 14 
US-09-393-696-6 



; Sequence 6, Application US/09393696 

; Publication No. US20030022277A1 

; GENERAL INFORMATION: 

; APPLICANT: Human Genome Sciences, Inc. et al. 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/09/393,696 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/462,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: PatentIn Ver. 2.0 

; SEQ ID NO 6 

; LENGTH: 372 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-393-696-6 



Query Match 85.4%; Score 1894; DB 10; Length 372; 

Best Local Similarity 99.4%; Pred. No. 6e-173; 

Matches 361; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I : M M I I I I I I I I I I I I I M I I M M I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSRDPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I M I i I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I M I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 15 
US-10-077-874-6 

; Sequence 6, Application US/10077874 
; Publication No. US20020115155A1 



GENERAL INFORMATION: 

; APPLICANT: Soppet^ Daniel et al 

; TITLE OF INVENTION: Human Neuropeptide Receptor 

; NUMBER OF SEQUENCES: 12 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 

CITY: Rockville 

STATE: MD 
; COUNTRY: USA 

ZIP: 20850 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/10/077,874 

; FILING DATE: 20-Feb-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
; FILING DATE: 05-JUNE-1995 

ATTORNEY/AGENT INFORMATION: 

NAME: Wales, Mi chele M. 
; REGISTRATION NUMBER: 43,975 

REFERENCE/DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 372 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-10-077-874-6 

Query Match 85.4%; Score 1894; DB 13; Length 372; 

Best Local Similarity 99.4%; Pred. No. 6e-173; 

Matches 361; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I 1.1 I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSRDPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy ' 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I i I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFQIFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



Search completed: October 14, 2004, 11:16:34 
Job time : 80.0952 sees 



I 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 14, 2004, 10:41:50 ; Search time 111.535 Seconds 

(without alignments) 
2192.441 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-070-532-2 

2219 

1 MEPSATPGAQMGVPPGSREP CSVSKISEHWLTSVTTVLP 425 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1825181 seqs, 575374646 residues 

Total number of hits satisfying chosen parameters: 1825181 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_02:* 

1: uniprot_sprot : * 
2: uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2218 . 


100. 


0 


425 


2 


Q9HBV6 


Q9hbv6 homo sapien 


2 


2214 


99. 


8 


425 


1 


OXIR HUMAN 


043613 homo sapien 


3 


2018.5 


91. 


0 


416 


1 


0X1R_RAT 


P56718 rattus norv 


4 


2015.5 


90. 


8 


416 


2 


Q6VNS3 


Q6vns3 mus musculu 


5 


2015.5 


90. 


8 


416 


2 


AAR01326 


Aar01326 mus muscu 


6 


1478.5 


66. 


6 


443 


2 


Q6VLX3 


Q6vlx3 mus musculu 


7 


1478.5 


66. 


6 


443 


2 


AAR01327 


Aar01327 mus muscu 


8 


1478.5 


66. 


6 


443 


2 


A7VR11294 


Aarll294 mus muscu 


9 


1478.5 


66. 


6 


460 


1 


0X2R MOUSE 


P58308 mus musculu 


10 


1478.5 


66. 


6 


460 


2 


AAR01328 


Aar01328 mus muscu 


11 


1478.5 


66. 


6 


460 


2 


AAR11293 


Aarll293 mus muscu 


12 


1474.5 


66. 


4 


46Q 


1 


0X2R_RAT 


P56719 rattus norv 


13 


1469 


66. 


2 


444 


1 


0X2R_CANFA 


Q9tup7 canis famil 


14 


1460 


65. 


8 


444 


1 


0X2 R HUMAN 


043614 homo sapien 


15 


1460 


65. 


8 


444 


2 


AAG28021 


Aag28021 homo sapi 



16 


1287 


58. 


0 


364 


2 


08BV78 




Tni 1 c! mi 1 Q ^*^ 1 1 1 1 
lULlO illLlO ^ux u. 


17 


1261. 5 


56. 


8 


260 


1 


OXIR MOUSE 


P58307 


mi3^ Tnijsr'ljlii 

Xll^pA 1»4 LA 


18 


976 


44 . 


0 


199 


2 


O80T45 


oft rn-4 ^ 


Tni 1 c! mi 1 c r^ii 1 11 
lllU 0 IWJl 0 Ul X U, 


19 


659. 5 


29. 


7 


166 


2 


08MJ13 




^Tri C AC! 

vjvxo cixxco 


20 


618 


27 . 


9 


127 


2 


08SPR4 


Oft c;r> r-4 


vjvxo dxxtJij 


21 


533 


24 . 


0 


109 


2 


Q8I010 


081010 




22 


522 . 5 


23 . 


5 


430 


1 


NFF1 HUMAM 


OQrf 'Trr^ 

yi^g zqo 


noino oapX6Il 


23 


512 


23. 


1 


417 


1 


NFF2 MOUSE 


OQ94hn 


iLLUk> lLlLli9^UXU. 


24 


508 . 5 


22 . 


9 


405 


2 




Q924n0 


mus musculu 


25 


507 . 5 


22 . 


9 


432 


3_ 


NFFI RAT 


Q9ep86 


rattus norv 


26 


499. 5 


22 . 


5 


522 


1 


NFF2 HUMAN 


Q9y5x5 


homo sapien 


27 


499 


22 . 


5 


417 


1 


NFF9 RAT 


Q9eqd2 


rattus norv 


28 


495 . 5 


22 . 


3 


399 


2 




Q75xu5 


gallus gall 


29 


495 . 5 


22 . 


3 


•O Zf Zf 


p 


RArft77 ft 9 


Bac87782 gallus ga 


■J \j 


4 ft n 

1 O 




a 
\j 


4 '^9 


9 

/L 


nQ94r;Q 

-7 ^ 0_7 


Q924g9 


rattus norv 


^1 






•a 




O 


07 YTT4 Q 


Q7yu49 


drosophila 


«-> ^ 




^ J. . 


n 


497 


1 

J. 


PfKR RARTT 


097772 


oryctolagus 


33 


452 . 5 


20 . 


4 


464 


2 


OQVRfl7 

^ _7 V JJ (J / 


Q9vb87 


drosophila 


34 


452 . 5 


20 . 


4 


464 


2 


AAFS66SS 


Aaf56655 drosophil 


35 


451 


20 


•J 


o o ^ 


1 


MY9R PTH 

IN X I\ JT X v7 


002836 


sus scrofa 


36 


447 S 




9 


44Q 


1 

J. 


I r\ UisSJirilii 


P25931 


drosophila 


37 


446 


20. 


1 


375 


2 


OS7Z6^ 

V> / T \J 0 


057463 


brachydanio 


38 


446 


20 . 


1 


517 


2 


^ V v» r\ J 


Q9vwr3 


drosophila 


O -7 


4 4 S 




1 
± 


J JO 


o 




Q7prc5 


anopheles g 


40 


444 


20 . 


0 


542 


2 


OQVRMO 


Q9vrm0 drosophila 


41 


444 


20. 


0 


542 


2 


AAF50775 


Aaf50775 drosophil 


42 


443.5 


20. 


0 


436 


2 


Q7T1P8 


Q7tlp8 


gallus gall 


43 


438.5 


19. 


8 


393 


2 


Q7T078 


Q7t078 


fugu rubrip 


44 


437 


19. 


7 


86 


1 


OXlR_PIG 


097661 


sus scrofa 


45 


436 


19. 


6 


370 


2 


Q6VMN6 


Q6vinn6 


mus musculu 



ALIGNMENTS 



RESULT 1 
Q9HBV6 

ID Q9HBV6 PRELIMINARY; PRT; 425 AA. 

AC Q9HBV6; 

DT Ol-MAR-2001 (TrEMBLrel. 16, . Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE Hypocretin receptor-1 (Orexin receptor 1) . 

GN Name=HCRTRl; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20429525; PubMed=10973318 ; 

RA Peyron C, Faraco J., Rogers W., Ripley B., Overeem S., Charnay Y. / 

RA Nevsimalova S., Aldrich M., Reynolds D., Albin R.,-Li R. , Hungs M. , 

RA Pedrazzoli M. , Padigaru M. , Kucherlapati M., Fan J., Maki R. , 

RA Lammers G.J., Bouras C, Kucherlapati R., Nishino S., Mignot E.; 

RT "A mutation in a case of early onset narcolepsy and a generalized 

RT absence of hypocretin peptides in human narcoleptic brains."; 



RL Nat. Med. 6:991-997(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21580342; PubMed=l 172 3285; 

RA Olafsdottir B.R.^ Rye D.B.^ Scainmell T.E., Matheson J.K., 

RA Stefansson K., Gulcher J.R,; 

RT "Polymorphisms in hypocretin/orexin pathway genes and narcolepsy."; 

RL Neurology 57:1896-1899(2001) . 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Olafsdottir B.R., Stef ansdottir R.H., Sigurdsson A., Hannesson H.H., 

RA Sainz J., Scammell T.E., Stefansson K., Gulcher J.R.; 

RL Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Yeager M., Welch R. , Haque K., Bergen A.; 

RL Submitted {DEC-2001) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pooled tissue; 

RX MEDLINE=22388257; PubMed=12477932 ; 

^RA Strausberg R.L., Feingold E.A. ^ Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F.^ Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L,, Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M., Scares M.B., Bonaldo M.F.^ Casavant T.L., Scheetz T.E./ 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J. , Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J.^ McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C./Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X./ Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pooled tissue; 

RA Strausberg R. ; 

RL Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF202084; AAG28020.1; -. 

DR EMBL; AF202078; AAG28020.1; JOINED. 

DR EMBL; AF202079; AAG28020.1; JOINED. 

DR EMBL; AF202081; AAG28020.1; JOINED. 

DR EMBL; AF202083; AAG28020.1; JOINED. 

DR EMBL; AF202082; AAG28020.1; JOINED. 

DR EMBL; AF202080; AAG28020.1; JOINED. 

DR EMBL; AY062030; AAL47214.1; 

DR EMBL; AY070269; AAL50221.1; -. 

DR EMBL; BC074796; AAH74796.1; 

DR GO; GO: 0016021; C: integral to membrane; lEA. 



DR GO; GO: 0016499; Fiorexin receptor activity; TEA. 

DR GO; GO: 0004872; F: receptor activity; lEA. 

DR GO; GO: 0001584; F: rhodopsin-like. receptor activity; TEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; lEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004059; Orexin_receptorl . 

DR Pfam; PFOOOOl; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01521; OREXINIR. 

DR PRINTS; PR01064; OREXINR. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; UNKN0WN_1 . 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 425 AA; 47535 MW; B650B37F3A2CA096 CRC64; 



Query Match 100.0%; Score 2218; DB 2; Length 425; 

Best Local Similarity 99.8%; Pred. No. 1.4e~144; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLI/sJVYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGH7VLCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180* 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I M I M I M I I I I M M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 M\AnjLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I : I I I I I I I I I I I I . 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 2 
0X1R_HUMAN 

ID OXlR_HUMAN STANDARD; PRT; 425 AA. 

AC . 043613; 



DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel- 39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) . 

GN Naine=HCRTRl ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo, 

OX NCBITaxI D=9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98150861; PubMed=9491897 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S,, Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A., Elshourbagy N.A,, Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M., Mignot E.; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness . " ; 

RL T^nu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 
CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 

CC exclusively coupled to the G(q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC _ 

DR EMBL; AF041243; AAC39601.1; -. 

DR Genew; HGNC:4 848; HCRTRl . 

DR MIM; 602392; -. 

DR GO; GO: 0005887; C: integral to plasma membrane; TAS. 

DR GO; GO: 0004930; F: G-protein coupled receptor activity; TAS. 

DR GO; GO:0007631; P : feeding behavior ; TAS. 

DR GO; GO:0007218; P : neuropeptide signaling pathway; TAS. 

DR GO; GO:0007268; P: synaptic transmission; TAS. 



UK 


InterPro; 


IPR000276; GPCR_Rhodpsn. 


UK 


InterPro; 


IPR000204; Orexin_ 


receptor . 


UK 


InterPro; 


IPR004059; Orexin_ 


receptorl. 


UK 


Pfam; PFOOOOl; 7tir 


t 1; 1. 




DR 


PRINTS; PR00237; GPCRRHODOPSN. 


UK 


PRINTS; PR01521; OREXINIR. 




UK 


PROSITE; 


PSUUzi / ; 


r' tj n r\'T* t? x tvT 

b PKOibiN 


KniL-Ejir c 1 L , 1 . 


UK 


PROSITE; 


PSoUzDZ ; 


r* ■r>ri/^rn'c TXT 

G rKOihlN 




KW 


G-protein 


coupled 


receptor; 


Transmenibrane . 


FT 


DOMAIN 


1 




c •^■4— V*— ly-^z-kT Till t 0^+* Ai** 4 ^ T \ 

tiXuracexxuxar ^ jrOT-enuiax > • 


FT 


TRANSMEM 


47 


67 


X ^ FOcenuiaj. ; • 


FT 


DOMAIN 


68 


80 


Cytoplasmic (Potential) • 


FT 


TRANSMEM 


81 


102 


z vt^ouenuiax; . 


FT 


DOMAIN 


103 


119 


Extracellular (Potential) . 


FT 


TRANSMEM 


120 


142 


o (Fotenuiai; . 


FT 


DOMAIN 


143 


164 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 




185 


4 (Potential) . 


FT 


DOMAIN 


186 


216 


Extracellular (Potential) . 


FT 


TRANSMEM 


217 


239 


5 (Potential) . ♦ 


FT 


DOMAIN 


240 


298 


Cytoplasmip (Potential) . 


FT 


TRANSMEM 


299 


321 


6 (Potential) . 


FT 


DOMAIN 


322 


336 


Extracellular (Potential) . 


FT 


TRANSMEM 


337 


360 


7 (Potential) . 


FT 


DOMAIN 


361 


425 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


194 


194 


N-linked (GlcNAc, . .) (Potential; 


SQ 


SEQUENCE 


425 AA; 


47521 MW; 1634083DE10CA092 CRC64; 


Query Match 




99.8%; 


Score 2214; DB 1; Length 425; 



Best Local Similarity 99.5%; Pred, No. 2.6e-144; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MEPSAT PGAQMGVP PGSREP S PVP PD YEDEFLRYLWRDYL YP KQ YEWVLI AAYVAVFWA 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFI7VLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 


Qy 


181 


AVMEGSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


181 


AVMECSSVLPELANRTRLFSVCDERW7VDDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 


300 






lllllllllllllllllllllllllllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 U 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 


420 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I 



Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



Qy 421 TTVLP 425 
I I I I I 

Db 421 TTVLP 425 

RESULT 3 
0X1R__RAT 

ID 0X1R_RAT STANDARD; PRT; 416 AA. 

AC P56718; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) . 

GN Name=Hcrtrl; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=98150861; PubMed=94 91897 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J.A. , Elshourbagy N.A. , Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors : a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M., Mignot E.; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 

CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 

CC exclusively coupled to the G{q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY; Highly expressed in the brain in the 

CC prefrontal cortex, hippocampus, paraventricular thalamus, 

CC ventromedial hypothalamus, arcuate nucleus, dorsal raphe nucleus, 

CC and locus coeruleus. Not detected in the spleen, lung, liver, 

CC skeletal muscle, kidney and testis. Orexin receptor mRNA 

CC expression has also been reported in the adrenal gland, enteric 



cc nervous system^ and pancreas. 

CC INDUCTION: By nutritional state, up-regulated by fasting. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 



CC 










DR 


EMBL; AF041244; AAC40041.1; 




DR 


RGD; 2787 


; Hcrtrl. 






DR 


InterPro; 


IPR000276; GPCR Rhodpsn. 


DR 


InterPro; 


IPR000204; Orexin_ 


receptor. 


DR 


InterPro; 


IPR004059; Orexin 


receptorl. 


DR 


Pfam; PFOOOOl; 7tm 1; 1. 




DR 


PRINTS; PRO 02 37; GPCRRHODOPSN . 


DR 


PRINTS; PR01521; OREXINIR. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN_ 


"rECEP_F1_2 ; 1 . 


KW 


G-protein 


coupled 


receptor; 


Transmembrane . 


FT 


DOMAIN 


1 


46 


Extracellular (Potential) . 


FT 


TRANSMEM 


47 


67 


1 (Potential) . 


FT 


DOMAIN 


68 


80 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


81 


102 


2 (Potential). 


FT 


DOMAIN 


103 


119 


Extracellular (Potential) . 


FT 


TRANSMEM 


120 


142 


3 (Potential) . 


FT 


DOMAIN 


143 


164 


Cytoplasmic (Potential) . 


FT 


TRT^^SMEM 


165 


185 


4 (Potential) . 


FT 


DOMAIN 


186 


216 


Extracellular (Potential) . 


FT 


TRANSMEM 


217 


239 


5 (Potential) . 


FT 


Day[AiN 


240 


298 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


299 


321 


6 (Potential) . 


FT 


DOMAIN 


322 


336 


Extracellular (Potential) . 


FT 


TRANSMEM 


337 


360 


7 (Potential) . 


FT 


DOMAIN 


361 


416 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


194 


194 


N-linked (GlcNAc. . .) (Po" 


SQ 


SEQUENCE 


416 AA; 


46799 MW; 774DE7A22EA05D18 CRC64; 



Query Match 91.0%; Score 2018.5; DB 1; Length 416; 

Best Local Similarity 92.0%; Pred. No. 7.2e-131; 

Matches 390; Conservative 8; Mismatches 17; Indels 9; Gaps 1; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I M I I I I III I II : I II II I I I I II II II II M I I II II I I I II I I I I :: I 
Db 1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

1 1 II I II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 II I II 1 1 1 1 1 

Db 61 LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II : I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 



Qy 181 AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPEIANRTRLFSVCDERWADELYPKIYHSCFFFWYIAPLGmGMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVT^KRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKM 300 

I I I I I I I I I I I I I I I I I I I I I : I I III I II M I II I II I I II II II I I I II II 

Db 241 KLWGPQIPGTTSALVRNWKRPSEQLEAQHQGLCTEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 IWVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I II I II I I I I I I I I M I II I I I I I I I I I I I I II I I I I I I I I II I I M I II I II I I I I I 

Db 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I II I II I III I I I I I I I I I I II I I : I I I I I I I : I 

Db 361 LSGKFREQFKAAFSCCLPGLG PSSSARHKSLSLQSRCSVSKVSEHWLTTV 411 

Qy 421 TTVL 424 

I I I I 

Db 412 TTVL 415 



RESULT 4 
Q6VNS3 

ID Q6VNS3 PRELIMINARY; PRT; 416 AA. 

AC Q6VNS3; 

DT 05-JUL-2004 (TrEMBLrel. 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 21, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; . Craniata; Vertebrata; Euteleostonii; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J . , Rande va H . S . ; 

RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY336083; AAR01326.1; 

DR GO; GO: 0004872; F: receptor activity; lEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004059; Orexin_receptorl . 

DR Pfam; PFOOOOl; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN. 

DR PRINTS; PR01521; OREXINIR. 

DR PRINTS; PR01064; OREXINR. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; UNKN0WN_1 . 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1 . 

KW Receptor. 

SQ SEQUENCE 416 AA; 46766 MW; A8958C594C365E00 CRC64; 

Query Match 90.8%; Score 2015.5; DB 2; Length 416; 

Best Local Similarity 92.0%; Pred. No. 1.2e-130; 

Matches 390; Conservative 8; Mismatches 17; Indels 9; Gaps 1 
Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



III II II III III I II :llllllllllllllllllllllllllllllllll::| 

1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 60 

61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCK 120 
II II I II M I I I I I I M I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I II I I I I 

61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLWAICLPASLLVDITESW 120 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

lllllllllllllllllllilllllllllllllllllllllllllllllllllhlllll . 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 

181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I II I M I 11 I I I I I I I I I I I I I I : I I II I I M I I I I I I I M I II I I M II II I I 

181 AVMECSSVLPELANRTRLFSVCDEHWADELYPKIYHSCFFIVTYLAPLGLMGMAYFQIFR 240 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

II I I I I I I I I I I II I I I I I I I I : II III I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 KLWGRQIPGTTSALVRNWKRPSEQLEAQHQGLCTEPQPRARAFIAEVKQMRARRKTAKML 300 



301 



360 



MWLLVFALCYLP I SVLNVLKRVFGMFRQASDREAVYACFTFSPMLVYANSAANP 1 1 YN F 
I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
301 MVVXLVFALCYLPISVLNVXKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I III I III II I Mill I 1:1111 I 11:1 

361 LSGKFREQFKAAFSCCLPGLG PGSSARHKSLSLQSRCSVSKVSEHWLTTV 411 

421 TTVL 424 

I I II 
412 TTVL 415 



RESULT 5 
AAR01326 
ID 
AC 
DT 
DT 
DT 
DE 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus . 



7VAR01326 PRELIMINARY; PRT; 416 AA. 

AAR01326; 

02-'MAR-2004 (TrEMBLrel. 27, 
02-MAR-2004 (TrEMBLrel. 27, 
02-MAR-2004 (TrEMBLrel. 27, 
Orexin receptor type-1. 
OS Mus mus cuius (Mouse) . 
OC Eukaryota; Metazoa; Chordata; 
OC Mammalia; Eutheria; Rodentia; 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J., Randeva H.S.; 

RT "Cloning and Characterization of the Mouse Type-1 Orexin Receptor 
RT (OXIR) . "; 

RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 
DR EMBL; AY336083; AAR01326.1; 
KW Receptor. 

SQ SEQUENCE 416 AA; 46766 MW; A8958C594C365E00 CRC64; 



Query Match 

Best Local Similarity 



90.8%; Score 2015.5; DB 2; Length 416; 
92.0%; Pred. No. 1.2e-130; 



Matches 390; Conservative 8; Mismatches 17; Indels 9; Gaps 1; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 

I I I I I I I I I I III I II : I I I I I I I I I I I I I I I I I I II II I I I I I II I I I I I :: I 
1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 



60 



60 



120 



61 LVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 
I I II II II II I I I 1! II I II I II I I II II I I I I II I I I I I I II I II I I II I I I II Mil 
61 LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGQALCK 120 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 



181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 
llllllllllllllllllllllll llhllllllllllllllllllllll llllllll 
181 AVMECSSVLPELANRTRLFSVCDEHWADELYPKIYHSCFFIWYLAPLGLMGMAYFQIFR 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

IIIIIIIIIIIIIIIIIIIMhIl III I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 KLWGRQI PGTTSALAmNWKRPSEQLEAQHQGLCTEPQPRARAFLAEVKQMRARRKTAKML 



301 



301 



361 



MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 



240 



240 



300 



300 



360 



360 



420 



LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I : I 

361 LSGKFREQFKAAFSCCLPGLG PGSSARHKSLSLQSRCSVSKVSEHWLTTV 411 

421 TTVL 424 

I I I I 
412 TTVL 415 



RESULT 
Q6VLX3 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 

oc 
oc 
ox 

RN 
RP 
RC 
RA 
RL 
CC 
CC 
DR 
DR 



Q6VLX3 
Q6VLX3 ; 
05-JUL-2004 
05'-JUL-2004 
05-JUL-2004 



PRELIMINARY; 



PRT; 



443 AA. 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 21, 
(TrEMBLrel. 27, 
(TrEMBLrel. 27, 
Orexin receptor type-2a. 
Name=M0XR2 ; 
Mus mus cuius (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=BALB/c; 

Chen J . , Rande va H . S . ; 

Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

SUBCELLULAR LOCATION: Integral membrane protein (By similarity) 
-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
EMBL; AY339389; AAR11294.1; 
EMBL; AY339383; AAR11294.1; JOINED. 



DR EMBL; AY339384; AAR11294.1; JOINED. 

DR EMBL; AY339385; AAR11294.1; JOINED. 

DR EMBL; AY339386; AAR11294.1; JOINED. 

DR EMBL; AY339387; AAR11294.1; JOINED. 

DR EMBL; AY339388; AAR11294.1; JOINED. 

DR EMBL; AY336084; AAR01327.1; 

DR GO; GO: 0004872; F: receptor activity; lEA. 

DR InterPro; IPR000923; BlueCu_l. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PFOOOOl; 7tm__l; 1. 

DR Pfam; PF03827; Orexin_rec2; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01522; 0REXIN2R. 

DR PRINTS; PR01064; OREXINR. 

DR PROSITE; PS00196; COPPER_BLUE; UNKN0WN_1 . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1; 1. 

DR PROSITE; PS50262; G__PR0TEIN__RECEP_F1_2 ; 1.. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 443 AA; 50559 MW; 794736A669463283 CRC64; 

Query Match 66.6%; Score 1478.5; DB 2; Length 443; 

Best Local Similarity. 69.7%; Pred. No. le-93; 

Matches 287; Conservative 48; Mismatches 68; Indels 9; Gaps 4; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

::|| I l|:| I I I I I I I I : I I : I I : I I I I I I I h llllll:ll lll:|ll:|l 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I M I I I I I I I I I I : I : I I I I I : I II : I I I I I II I I I 11 II : II 
Db 84 HMRTVTNYFIWLSLADVLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III I I II I II II I II I : I II II : I I I II: II II I I : I I I I I I I I I : I I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I 1! : I II I I : : II I : II II I : I II : I II II : II I I I I I I I I I I I I I : I : 
Db 204 TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK--RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLP 313 

III : I I : I : : I I I I : M : I I I I M I : II I II I M I I : M I I 

Db 264 RKWKQQQPVSQ PRGSGQQSKARISAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

I I : I I I I I I II I I I III ill I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 

Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSASHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

I II I I I I : I I I I : I : : I M : I I II I I I I : : I : 

Db 380 SCCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 431 



RESULT 7 
AAR01327 



ID 

AC 
DT 
DT 
DT 
DE 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
DR 
KW 
SQ 



AAR01327 
AAR01327; 
02-MAR-2004 
02-MAR-2004 
02-MAR-2004 



PRELIMINARY; 



PRT; 



443 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi; 
Sciurognathi; Muridae; Murinae; Mus. 



(TrEMBLrel. 21, 
(TrEMBLrel. 27, 
(TrEMBLrel. 27, 
Orexin receptor type-2a. 
Mus mus cuius (Mouse) . 
Eukaryota; Metazoa; Chorda ta; 
Mainmalia; Eutheria; Rodentia; 
NCBI_TaxI D=l 0 090; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=BALB/c; 

Chen J . , Randeva H . S . ; 

"Cloning and Characterization of the Mouse Type-2a Orexin Receptor 
Subtype (0X2aR) •"; 

Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

EMBL; AY336084; AAR01327.1; . 

Receptor. 

SEQUENCE 443 AA; 50559 MW; 794736A669463283 CRC64; 



Query Match 66.6%; Score 1478.5; DB 2; 

Best Local Similarity 69.7%; Pred. No. le-93; 
Matches 287; Conservative 48; Mismatches 68; 



Length 443; 
Indels 9; 



Gaps 



4; 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

:: I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

76 HMRTWNYFIWLSLADVLWAICLPASLLWITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : II I I I II I I I I I I I : II 
84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III I II II I I II I I I I : II I I I : I II II: II II I h I I I I I II II : II III: 
144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTS/y^V 255 

I 11:1111 I : : I I I : I I I I I : I II : I II I I : I I II I I I I I I I I I II : I : 
204 TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

256 RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

264 RKWKQQQPVSQ PRGSGQQSKARISAVAAEIKQIRARRKTT^RMLMWLLVFAICYLP 319 

314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 

320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 



Qy 



Db 



374 SCCLPGLGPCGSLKAPSPRSSASHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

I I I I I i I : I I I I : I : : I I I : I I I I I I I I : : I : 

380 SGCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 431 



RESULT 8 
AAR11294 
ID AAR11294 



PRELIMINARY; 



PRT; 



443 AA. 



AC AAR11294; 

DT 02-iyiAR-2004 (TrEMBLrel. 21, Created) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2a. 

GN M0XR2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=B7VLB/c; 

RA Chen J . , Rande va H . S . ; 

RT "Genomic structure analysis of the Mus musculus orexin type-2 (M0XR2) 

RT gene . " ; 

RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY339383; AAR11294.1; JOINED. 

DR EMBL; AY339384; AAR11294.1; JOINED. 

DR EMBL; AY339385; AAR11294.1; JOINED. 

DR EMBL; AY339386; A7VR11294.1; JOINED. 

DR EMBL; AY339387; AAR11294.1; JOINED. 

DR EMBL; AY339388; AAR11294.1; JOINED. 

DR EMBL; AY339389; AAR11294.1; -. 

KW Receptor . 

SQ SEQUENCE 443 AA; 50559 MW; 794736A669463283 CRC64; 



Query Match 66.6%; Score 1478.5; DB 2; Length 443; 

Best Local Similarity 69.7%; Pred. No. le-93; 

Matches 287; Conservative 48; Mismatches 68; Indels 9; Gaps 4; 



Qy 


17 


SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 


75 




::|| 1 l|:| 1 1 1 1 1 1 1 1 : 1 1 : 1 1 : 1 1 1 1 1 1 1 l: lllllhll lll:|l|:ll 




Db 


24 


TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLI AGYI I VFWALI GNVLVCVAVWKNH 


83 


Qy 


76 


HMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 


135 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llll:|:|llll:| II HIIIIIIII 




Db 


84 


HMRTVTNYFIVNLSIADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 


143 


Qy 


136 


TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 


195 






III lllllllllllll:lllll:|ll II: II II ll:|ll llllll:|l III: 




Db 


144 


TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 


203 


Qy 


196 


TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 






1 11:1111 1 : : 1 1 1 : 1 1 II 1 : 1 1 1 : 1 II II : 1 1 1 1 II II 1 1 1 1 1 1 1 : 1 : 




Db 


204 


TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 


263 


Qy 


256 


RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 


313 






III :| 1 :l : : 1 1 1 1 : II : II 1 1 1 1 1 : 1 1 1 1 1 II 1 1 1 : 1 1 1 1 




Db 


264 


RKWKQQQPVSQ PRGSGQQSKARI SAVAAEI KQI RT^RKTARMLMWLLVFAI C YLP 


319 


Qy 


314 


ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 


373 






1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 : 1 1 1 1 1. 




Db 


320 


I S I LNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPII YNFLSGKFREEFKAAF 


379 


Qy 


374 


SCCLPGLGPCGSLKAPSPRSSASHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 





I I I I I I I : I I I I : I : : I I I : I I I I I I I I : : I : 



Db 380 SGCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 431 



RESULT 9 
0X2R_M0USE 

ID 0X2R_M0USE STANDARD; PRT; 460 hA. 

AC P58308; Q8BG12; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2). 

GN Naiae=Hcrtr2; Synonyms=Mox2r ; 

OS Mus musculus (Mouse). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Cerebellum, and Hypothalamus; 

RX MEDLINE=22354683; PubMed=12466851; DOI=10 . 1038/nature01266; 

RA Okazaki Y., Furuno M. , Kasukawa T., Adachi J., Bono H. , Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K. , Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J.A. , Bradt D., Brusic V., Chothia C, Corbani L.E., Cousins S 

RA Dalla E., Dragani T.A. , Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N,, Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H., Kawasawa Y., Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L. , McKenzie L., Miki H., 

RA Nagashima T., Numata K., Okido T., Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N., Pillai R., Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M., 

RA Sandelin A., Schneider C, Semple C.A., Setou M. , Shimada K. , 

RA Sultana R., Takenaka Y., Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R. , Wagner L. , Wahlestedt C, Wang Y., Watanabe Y., Wells C. 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L., 

RA Yuan Z., Zavolan M. , Zhu Y., Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N., Sato K., 

RA Shiraki T., Waki K., Kawai J., Aizawa K., Arakawa T., Fukuda S., 

RA Kara A., Hashizume W., Imotani K., Ishii Y. , Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K., Sasaki D., Shibata K., Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J., 

RA Birney E., Hayashizaki Y. ; 

RT "Analysis of the mouse trans criptome based on functional annotation 

RT 60,770 full-length cDNAs."; 

RL Nature 420:563-573(2002). 

RN [2] 

RP SEQUENCE OF 100-311 FROM N.A. 

RC STRAIN=C57BL/6; 

RA Szendro P.I., Maevers K., Eichele G. ; 

RT "Cloning of mouse orexin receptors."; 

RL Submitted (JUN-2001) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP REVIEW, 



RX MEDLINE=21237974; PubMed=11340621 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [4] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness . " ; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G~protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib- ch) . 

CC ; 

DR EMBL; AK038551; BAC30039.1; -. 

DR EMBL; AK0487B1; BAC33457.1; -.• 

DR EMBL; AF394597; AAK71327.1; -. 

DR MGD; MGI: 1889024; Mox2r. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PFOOOOl; 7tm_l; 1. 

DR Pfam; PF03827; Orexin rec2; 1. 



DR 


PRINTS; PR00237; 


GPCRRHODOPSN. 


DR 


PRINTS; PR01522; 


0REXIN2R. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN^ 


RECEP F1 1; 1. 


DR 


PROSITE; 


PS50262; 


g_protein] 


]rECEP_F1__2 ; 1. 


KW 


G-protein 


, coupled 


receptor; 


Glycoprotein; Transmembrane. 


FT 


DOMAIN 


1 


54 


Extracellular (Potential) . 


FT 


TRANSMEM 


55 


75 


1 (Potential) . 


FT 


DOMAIN 


76 


88 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


89 


110 


2 (Potential) . 


FT 


DOMAIN 


111 


127 


Extracellular (Potential) . 


FT 


TRANSMEM 


128 


150 


3 (Potential). 


FT 


DOMAIN 


151 


172 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


173 


193 


4 (Potential) . 


FT 


DOMAIN 


194 


224 


Extracellular (Potential) . 


FT 


TRANSMEM 


225 


247 


5 (Potential) . 


FT 


DOMAIN 


248 


304 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


305 


. 327 


6 (Potential) . 


FT 


DOMAIN 


328 


342 


Extracellular (Potential) . 


FT 


TRANSMEM 


343 


366 


7 (Potential). 


FT 


DOMAIN 


367 


460 


Cytoplasmic (Potential) . 


FT 


C7VRB0HYD 


14 


14 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


22 


22 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


202 


202 


N-linked (GlcNAc. . .) (Potential) 


FT 


CONFLICT 


201 


201 


A -> T (in Ref . 2) . 



FT CONFLICT 240 240 I -> V (in Ref . 2) . 

SQ SEQUENCE 460 AA; 52461 MW; D62A67C15BA67DCC CRC64; 



Query Match 66.6%; Score 1478.5; DBl; Length 460; 

Best Local Similarity 69.7%; Pred. No. l.le-93; 

Matches 287; Conservative 48; Mismatches 68; Indels 9; Gaps 4; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I M I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

II I I I I I I I I I I II I II I I I I M M : I : I I I I I : I II : I I II I I I I I I II I I : II 

Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III I II I II I II M I I : I II I I : I I I I I : II II 1 I : II I I I I I I I : I I III: 

Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVVIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKlYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : M I I I :: I I I : M I I I : I I I : I I I II : I I I I I I I I I I I I I I I : I : 

Db 204 TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

Db 264 RKWKQQQPVSQ PRGSGQQSKARISAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

I I : I I I M I I I I M III III I I I I I I I I I I I I I I I I I II I I I I I II I I : I I I I I 

Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLK7VPSPRSSASHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

I I I I I I I : I I I I : I : : I I I : I I I I I I I I : : I : 

Db 380 SCCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 431 



RESULT 10 
AAR01328 



ID AAR01328 PRELIMINARY; PRT; 460 AA. 

AC AAR01328; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel, 27, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2b. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J., Randeva H.S.; 

RT "Cloning and Characterization of the Mouse Type-2b Orexin Receptor 

RT Subtype (0X2bR)."; 

RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases - 

DR EMBL; AY336085; AAR01328.1; -. 

KW Receptor- 



SQ SEQUENCE 460 AA; 52461 MW; D62A67C15BA67DCC CRC64; 



Query Match 66.6%; Score 1478.5; DB 2; Length 460; 

Best Local Similarity 69.7%; Pred. No. l.le-93; 

Matches 287; Conservative 48; Mismatches 68; Indels 9; Gaps 4; 



Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVXIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVEWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTWNYFIWLSIJU3VLVTAICLPASLLVDITESWLFGH7VLCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I II I I I I II I I I I I : II 
Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III lllllllllllll:|llll:lll M: II II MUM IIIIIMM MM 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I 11:1111 I ::|||:|| 111:111:111 II :|l lllllll llllll:l : 

Db 204 TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 E^K — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : : I I II : II : I I II II I : II I I II II I I : II II 

Db 264 RKWKQQQPVSQ PRGSGQQSKARISAVAAEIKQIRARRKTJDjyy[LMVVLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

II : II I I II I I II I III III II II I II I II I II II II I I I II II I II I : I II I I 

Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSASHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

MM I I I: I III: I : : II I : II I M II I : : I : 

Db 380 SCCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 431 



RESULT 11 
7^^11293 



ID AAR11293 PRELIMINARY; PRT; 460 AA. 

AC AAR11293; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last sequence update) 

DT 02-M7VR-2004 {TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2b. 

GN M0XR2 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J . , Rande va H . S . ; 

RT "Genomic structure analysis of the Mus musculus orexin type-2 (M0XR2) 

RT gene,"; 

- RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY339383; AAR11293.1; JOINED. 

DR EMBL; AY339384; AAR11293.1; JOINED. 



DR EMBL; AY339385; AAR11293.1; JOINED. 

DR EMBL; AY339386; AAR11293.1; JOINED. 

DR EMBL; AY339387; AAR11293.1; JOINED. 

DR EMBL; AY339388; AAR11293.1; JOINED. 

DR EMBL; AY339389; AAR11293.1; JOINED. 

DR EMBL; AY339390; AAR11293.1; -. 

KW Receptor. 

SQ SEQUENCE 460 AA; 52461 MW; D62A67C15BA67DCC CRC64; 

Query Match 66.6%; Score 1478.5; DB 2; Length 460; 

Best Local Similarity 69.7%; Pred. No. l.le-93; 
Matches 287; Conservative 48; Mismatches 68; Indels 



9; Gaps 



4; 



Qy 

Db 

Qy 

Db 



17 



24 



76 



84 



SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 

: : I I I I I : I I I I I I I M : I I : I I : I I I I I I I I : I I I I I I : I I M I : I I I : I I 
TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYI I VFWALI GNVLVCVAVWKNH 



75 



83 



135 



HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I I I I II I I I II I II : II 
HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



136 



144 



196 



204 



TLSFIALDRWYAICHPLLFKSTARRi\RGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

IN I II M II I I M II : II I I I : I M II: II II I I : I I I I I I I II : I I III: 
TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

111:11111 :: I I I : II I I I : I I I : II I I I : I I II I I I I I I I II I I : I : 
TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 



256 RNWK— RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 
IN : I I : I : : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

264 RKWKQQQPVSQ— — PRGSGQQS KARI SAVAAEI KQI RARRKTARMLMWLLVFAI CYLP 



314 



320 



374 



380 



313 



319 



373 



ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 

I I : I I I I I I I I I I I III III I I i I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 

ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 



SCCLPGLGPCGSLKAPSPRSSASHKSLSLQ — SRCSVSKISEHWLTSVTTV 

MM I I I: I III: I : : II I : II II II I I : : I : 

SCCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 



423 



431 



RESULT 12 
0X2 R RAT 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
DC 
OX 
RN 
RP 



0X2R_RAT STANDARD; PRT; 460 AA. 

P56719; 

30-MAY-2000 (Rel. 39, Created) 

30~MAY-2000 (Rel. 39, Last sequence update) 

05-JUL-2004 (Rel. 44, Last annotation update) 

Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2). 

Name=Hcrtr2; 

Rattus norvegicus (Rat) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10116; 
[1] 

SEQUENCE FROM N.A. 



Craniata ; Ver tebr ata ; Euteleos tomi ; 
Sciurognathi; Muridae; Murinae; Rattus. 



RC TISSUE=Brain; 

RX MEDLINE=98150861; PubMed-94 91897 ; 

RA Sakurai T., Amemiya A., Ishii M. ^ Matsuzaki I., Chemelli R.M., 

RA Tanaka H.^ Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S.^ Terrett J.A. , Elshourbagy N.A., Bergsma D,J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. ^ Mignot E.; , 

RT "Hypocretin/orexin^ sleep and narcolepsy."; 

RL Bioessays 23:397-408 (2001) . 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476;. PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M. , Sinton C.M.^ Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Expressed in the brain in the cerebral cortex, 
CC septal nuclei, hippocampus, medial thalamic groups, dorsal and 

CC median raphe nuclei, and many hypothalamic nuclei including the 

CC tuber omammillary nucleus, dorsomedial hypothalamus, 

CC paraventricular hypothalamic nucleus, and ventral premammillary 

CC nucleus. Not detected in the spleen, lung, liver, skeletal muscle, 

CC kidney and testis. Orexin receptor mRNA expression has also been 

CC reported in the adrenal gland, enteric nervous system, and 

CC pancreas . 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC . 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC tfie European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb~sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC ^ 

DR EMBL; AF041246; AAC4p042.1; 

DR RGD; 2788; Hcrtr2. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PFOOOOl; 7tm_l; 1. 

DR Pfam; PF03827; Orexin_rec2; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01522; 0REXIN2R. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Glycoprotein; Transmembrane. 
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FT 


DOMAIN 


367 


460 


Cytoplasmic (Potential) . 


FT 


CARBOHYP 


14 


14 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


22 


22 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


202 


202 


N-linked (GlcNAc. . .) (Potential) 


SQ 


SEQUENCE 


4 60 AA; 


52489 


MW; 3B44E3D82F8B85D5 CRC64; 



Query Match 66.4%; Score 1474.5; DB 1; Length 460; 

Best Local Similarity 69.8%; Pred. No. 2e-93; 

Matches 286; Conservative 48; Mismatches 71; Indels 5; Gaps 3; 



Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

::|| I ll:| I I I I I I I I : I I : I I : I I I I I I I |: lllllhll lll:|l|:ll 

Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIWLSLMVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I II I I I I II I I I I I : I I 

Db 84 HMRTVTNYFIWLSLADVLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIT^DRWYAICHPLLFKSTARRARGSILGIWAVSLMMVPQAAVMECSSVLPELANR 195 

III I I I I I I I I II I M : I I I I I : I I I II: II II I I : I I I III I I : I I I I I : 

Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMERSSMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : I I I I II : : II I : I I I I I : I I I : I II II : I I II I I I I I I I I I I I : I : 

Db ,204 TTLFTVCDERWGGEWPKMYHICFFLWYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMNAH^LVFT^CYLPIS 315 

I I I : I I : I : : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I I I 

Db 264 RKWKQP--QPVSQPRGSGQQSKARISAVAAEIKQIRARRKTAJRMLMVVLLVFAICYLPIS 321 



Qy 316 VLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSC 375 

: I M I I I I I I I I III III I I I I I I I I I I I I I I I II I I I I I I I I I I I : I I I I I II 

Db 322 IIJmjKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFK7\AFSC 381 

Qy 376 CLPGLGPCGSLKAPSPRSSASHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

II I I I : I I I I : I : : I II : I I I I I I I : : I : 

Db 382 CLGVHRRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHVALTSISTL 431 



RESULT 13 
0X2R_CANFA 

ID 0X2R_CANFA STANDARD; PRT; 444 AA. 

AC Q9TUP7; 



DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN Name=HCRTR2 ; 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99385793; PubMed=10458611 ; 

RA Lin L., Faraco J., Li R., Kadotani H., Rogers W., Lin X., Qiu X., 

RA de Jong P. J., Nishino S., Mignot E. ; 

RT "The sleep disorder canine narcolepsy is caused by a mutation in the 

RT hypocretin receptor 2 gene."; 

RL Cell 98:365-376(1999). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE-21178476; PubMed=112 83317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

RN [4] 

RP VARIANT NARCOLEPSY LYS-54, AND MUTAGENESIS OF GLU-54. 

RX MEDLINE=2 118 0003; PubMed=112 82968 ; DOI=10 . 1101/gr. 161001; 

RA Hungs M., Fan J., Lin L., Lin X., Maki R.A., Mignot E, ; 

RT "Identification and functional analysis of mutations in the hypocretin 

RT (orexin) genes of narcoleptic canines."; 

RL Genome Res. 11:531-539(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 

CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DISEASE: Defects in HCRTR2 are a cause of an autosomal recessive 

CC form of narcolepsy, observed in labradors, dobermans and 

CC dachshunds. Narcolepsy is a neurological sleep disorder affecting 

CC animals and humans, characterized by excessive daytime sleepiness, 

CC sleep fragmentation, symptoms of abnormal rapid- eye-mouvement 

CC (REM) sleep, such as cataplexy, hypnagogic hallucinations, and 

CC sleep paralysis. Cataplexy is a sudden loss of muscle tone 

CC triggered by emotions, which is the most valuable clinical feature 

CC used to diagnose narcolepsy. As in humans, most cases of canine 

CC narcolepsy are sporadic but an autosomal recessive form was also 

CC observed. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http: //www. isb-sib, ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF164626; AAD49333.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PFOOOOl; 7tm_l; 1. 

DR Pfam; PF03827; Orexin_rec2; 1. ^ 

DR PRINTS; PR00237; GPCRRHODOPSN . 



DR 


PRINTS; PR01522; OREXIN2R. 




DR 


PROSITE; 


PS00237; 


\3 c KU i rj 1 N 


RECEP Fl 1 ; 1 . 


DR 


PROSITE; 


PS50262; 


G PROTEIN 


RECEP Fl 2; 1. 


KW 


Disease mutation; 


G-protein 


coupled receptor; Glycoprotein; 


KW 


Transmembrane . 






FT 


DOMAIN 


1 


C A 


Extracellular (Potential) . 


FT 


TRANSMEM 


55 


/ O 


1 (Potential) . 


FT 


DOMAIN 


76 


Q O 
O O 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


89 


110 


2 (Potential) . 


FT 


DOMAIN 


111 


127 


Extracellular (Potential) . 


FT 


TRANSMEM 


128 


150 


3 ( Potential) . 


FT 


DOMAIN 


151 


172 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


173 


193 


4 (Potential)/. 


FT 


DOMAIN 


194 


224 


Extracellular (Potential) . 


FT 


TRANSMEM 


225 


247 


5 (Potential) . 


FT 


DOMAIN 


248 


304 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


305 


327 


6 (Potential) . 


FT 


DOMAIN 


328 


342 


Extracellular (Potential) . 


FT 


TRANSMEM 


343 


366 


7 (Potential) . 


FT 


DOMAIN 


367 


444 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


14 


14 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


22 


22 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


202 


202 


N-linked (GlcNAc. . O (Potential) 


FT 


VARIANT 


54 


54 


E -> K (in autosomal recessive 


FT 








narcolepsy) . 


FT 


MUTAGEN 


54 


54 


E->K: Loss of function. 


SQ 


SEQUENCE 


444 AA; 


50675 MW; D848A4536D485D6B CRC64; 



Query Match 66.2%; Score 1469; DB 1; Length 444; 

Best Local Similarity 69.2%; Pred. No. 4.7e-93; 

Matches 286; Conservative 51; Mismatches 66; Indels 10; Gaps 5; 



Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

::|| I ll:| I I I I I I I I : I I : I I : I I I I I I I |: lllllllll llhllhll 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALVGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I II I II II I I I I I I : I I 
Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III I I I 11 I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I I I I :: II III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I 11:111111 ::|l|:|l lll:|ll:lll II :|l lllllll lllllhl : 



Db 



204 TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 



Qy 256 RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III :|: I :| : : I I 11:11:1111111:1111111111:1111 

Db 264 RKWKPLQPASQ PRGPGQQTKSRISAVAAEIKQIRARRKTARMLM\A/LLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

II : II I II II II II III III I I I I I I I I I I I II I I I I I I I II I I I I I I : I II I I 

Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSA7\NPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

III I: : l:l I III: I : : || | : I I I I I I I : : I : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNVSKLSEQWLTSISTL 432 

RESULT 14 
0X2R_HUMAN 

ID 0X2R_HUMAN STANDARD; PRT; 444 AA. 

AC 043614; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) - 

GN Name-HCRTR2 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98150861; PubMed=9491897 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A. , Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J.A. , Elshourbagy N.A. , Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21580342; PubMed-11723285 ; 

RA Olafsdottir B.R., Rye D.B.,, Scammell T.E., Matheson J.K., 

RA Stefansson K., Gulcher J.R.; 

RT "Polymorphisms in hypocretin/orexin pathway genes and narcolepsy ." ; 

RL Neurology 57:1896-1899(2001) . 

RN [3] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408 (2001) . 

RN [4] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=112 83317 ; 

RA Willie. J. T., Chemelli R.M. , Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and . 



RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -I- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(§isb-sib . ch) . 

CC ' 

DR EMBL; AF041245; AAC39602.1; 

DR EMBL; AY062031; AAL47215.1; -. 

DR Genew; HGNC:4849; HCRTR2 . 

DR MIM; 602393; -. 

DR GO; GO: 0005887; C: integral to plasma membrane; TAS. 

DR GO; GO: 0008188; F: neuropeptide receptor activity; TAS. 

DR GO; GO: 0007631; P: feeding behavior; TAS. 

DR GO; GO:0007218; P : neuropeptide signaling pathway; TAS. 

DR GO; GO:0007268; P:synaptic transmission; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PFOOOOl; 7tm_l; 1. 

DR Pfam; PF03827; Orexin_rec2; 1. 

DR PRINTS; PRO 02 37; GPCRRHODOPSN . 

DR PRINTS; PR01522; 0REXIN2R. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Glycoprotein; Transmembrane. 

FT DOMAIN 1 54 Extracellular (Potential) . 

FT TRANSMEM 55 75 1 (Potential) . 

FT DOMAIN 76 88 Cytoplasmic (Potential) . 

FT TRANSMEM 89 110 2 (Potential). 

FT DOMAIN 111 127 Extracellular (Potential). 

FT TRANSMEM 128 150 3 (Potential). 

FT DOMAIN 151 172 Cytoplasmic (Potential). 

FT TRANSMEM 173 193 4 (Potential). 

FT DOMAIN 194 224 Extracellular (Potential) . 

FT TRANSMEM 225 247 5 (Potential) . 

FT DOMAIN 248 304 Cytoplasmic (Potential). 

FT TRANSMEM 305 327 6 (Potential) . 

FT DOMAIN 328 342 Extracellular (Potential). 

FT, TRANSMEM 343 366 7 (Potential). 

FT DOMAIN 367 444 Cytoplasmic (Potential) . 

FT CARBOHYD 14 14 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 22 22 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 202 202 N-linked (GlcNAc. . .) (Potential). 

SQ SEQUENCE 444 AA; 50680 MW; CA0669F0D4224C65 CRC64; 



Query Match , 65.8%; Score 1460; DB 1; Length 444; 

Best Local Similarity 68.8%; Pred. No. 1.9e-92; 

Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 



5; 



Qy 

Db 



17 
24 



SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 
::|| I I I I I I I I I : I I : M : I I.I I I I I |: II Nihil I I I : I II : I.I 

TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 



Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

111111111111111111111 I I I I : I : I I I I I : I II : II I I I I I I I I I I I I : I I 
Db 84 HMRTVTNYFIWLSIJU)VLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAWECSSVLPELANR 195 

III I I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I I I I : I I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I 11:111111 : : I I I : I I I I I : I I I : I I I II : I I I I I I I I I I I II II : I : 

Db 204 TTLFWCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK--RPSDQLGDLEQGLSGEPQPRARAFI^VKQMRARRKTAKMLMVVLLVFALCYLP 313 

III : I I : I : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

Db 264 RKWKPLQPVSQ PRGPGQPTKSRMSAVAAEIKQIRARRKTARMLMVVLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

II : M I I I I I I I I I III III II I I II I I I I I I I I I I I II I I I I I I I I I : I I I I I 

Db 320 ISILNVLKRVFGMFAHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

III I: : 1:1 I III: I : ::||:|l lllll::|: 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



RESULT 15 




AAG28021 




ID 


AAG28021 PRELIMINARY; PRT; 444 AA. 




AC 


AAG28021; 




DT 


02-MAR-2004 (TrEMBLrel. 27, Created) 




DT 


02-MAR-2004 (TrEMBLrel. 27, Last sequence update) 




DT 


02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 




DE 


Hypocretin receptor~2. 




GN 


HCRTR2 . 




OS 


Homo sapiens (Human) . 




oc 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


oc 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 


Homo. 


ox 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDL I NE=2 0 4 2 9 5 2 5 ; PubMed= 10973318; 




RA 


Peyron C, Faraco J., Rogers W.^ Ripley B., Overeem 


S . , Charnay Y. , 


RA 


Nevsimalova S., Aldrich M., Reynolds D., Albin R., Li R. , Hungs M, , 


RA . 


Pedrazzoli M., Padigaru M. , Kucherlapati M. , Fan J., 


Maki R,, 


RA 


Lammers G.J., Bouras Kucherlapati R. , Nishino S. 


, Mignot E. ; 


RT 


"A mutation in a case of early onset narcolepsy and 


a generalized 


RT 


absence of hypocretin peptides in human narcoleptic 


brains . "; 


RL 


Nat. Med. 6:991-997(2000). 




DR 


EMBL; AF202091; AAG28021.1; -. 




DR 


EMBL; AF202085; AAG2B021.1; JOINED. 




DR 


EMBL; AF202086; AAG28021.1; JOINED. 




DR 


EMBL; AF202087; AAG28021.1; JOINED. 





DR EMBL; AF202088; AAG28021.1; JOINED. 

DR EMBL; AF202089; AAG28021.1; JOINED. 

DR EMBL; AF202090; AAG28021.1; JOINED. 

KW Receptor. 

SQ SEQUENCE 444 AA; 50680 MW; CA0669F0D4224C65 CRC64; 

Query Match 65.8%; Score 1460; DB 2; Length 444; 

Best Local Similarity 68.8%; Pred. No. 1.9e-92; 

Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 5; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIT^AYVAVFWALVGNTLVCLAVWRNH 75 

::|| I I I I I I I I I : I I : I I : I I I I I I I l: lllllhll lll:|ll:|| 

Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWLIAGYIIVFVVALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I i I I I I I I I I I I I I I I I I I I I llll:|:|llll:| II Mllllllll llllhll 
Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy i36 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III llllllllllllhllllhlll II: II II 11:111 llllhl I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : I I II I I : : I I I : I I I I I : I I I : I I I II : I I I I I II I I I I IJ I I : I :' 
Db 204 TTLFWCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK— RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : I I I I : II : II I II I I : I I II I II I II : I II I 

Db 264 RKWKPLQPVSQ PRGPGQPTKSRMSAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANS7y\NPIIYNFLSGKFREQFKAAF 373 

I I : I I I I I I I I I I I III III I II I I I I I I I I II I I I I I I I I I I I I I I I : I I I I I 

Db 320 ISILNVLKRVFGMFAHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

III I : : I : I I I I I : I : : : I I : .1 I I I I I I : : I : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



Search completed: October 14, 2004, 10:55:47 
Job time : 113.535 sees 



